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SEX DIFFERENCES IN MENTAL GROWTH AND 
DECLINE? 


H. 8. CONRAD, H. E. JONES AND H. H. HSIAO 
Institute of Child Welfare, University of California 


The present report is based on data from the Army Alpha intelli- 
gence examination, administered to five hundred eighty-one males and 
six hundred seven females in a representative rural group, between the 
ages of ten and sixty. Details of sampling and of test administration 
are given in a previous report (7). 

Prior work has shown that for the total sample, the developmental 
curve is linear to about sixteen years, with a negative acceleration 
beyond sixteen to a peak between the ages of eighteen and twenty-one. 
A decline follows, which is much more gradual than the rise during 
adolescence, but which by the age of fifty-five involves a recession to 
the fourteen-year level (as computed from a fitted or a smoothed 
curve (3, 7)). 

We are concerned here with the comparative analysis of the 
developmental data for the two sexes. Table I gives the number of 
cases in each age-sex group; Tables II and III present mean and 
median scores, respectively, for these groups. Means and medians 
are presented not only for performance in the total Alpha, but also for 
the “abbreviated Alpha,’ and for each subtest of the Alpha.* Table 


' The data for the present study were collected with the aid of a research grant 
from the Social Science Research Council of Columbia University. Acknowledg- 
ments are due to Professors R. 8. Woodworth and A. T. Poffenberger, of Columbia 
University, for advice and administrative assistance. 

2 The ‘‘abbreviated Alpha” is the same as total Alpha, except for the elimina- 
tion of the scores in subtest 8 (which measures ‘‘general information”) and subtest 
4 (which measures vocabulary through an opposites test). Due to the fact that 
at the upper ages, accumulated experience tends unduly to weight the information 
tests 4 and 8, we have considered the abbreviated Alpha to provide a better com- 
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IV gives the reliability of sex differences in each of these tests, for 
combined age-groups. From Table IV it can be seen that while the 
males are fairly consistently inferior in total score, the difference is 
reliable only in the age period ten to fifteen. In later adolescence the 
boys tend to catch up in mental test ability (in this respect the mental 
growth curves are similar to physical growth curves). Minor selection 
differences may influence the results at certain ages; for example, it 
appears likely that a slightly inferior selection of women was obtained 
at ages twenty-five to twenty-nine, due to the policy, in home can- 
vassing, of including all families with two or more children (7).! In 
Tables II-IV, a slight but unreliable tendency appears for sex differ- 
ences to diminish to age forty or forty-five, and to increase thereafter. 
This tendency, consistent in nearly all of the subtests, could be deter- 
mined by errors of sampling.? 


TaBLp I.—NvumsBeErR or Cases IN Eacu AGE-GROUP 





















































Age-group 
Sex 
10 | 11 | 12 | 18 | 14 | 15 | 16 | 17 | 18 
te 16 | 22 | 34 | 35 | 27] 31 | 42 | 29 | 27 
Fomale.......... is | 33 | 31 | 35 | 29 | 30 | 32 | 21 | 19 
19-21|22-24|25~29|30-34|35-39]40-44145-49150-54/55-59 Total 
a i 2 46 | 23 | 41 | 51 | 36 | 46 | 27 | 32 | 16 | 581 
ale 41 21 47 55 59 51 41 27 17 607 








parative measure of the intelligence of different age-groups, than the total Alpha 
test (2, 7). 

’ There are eight Alpha subtests, which may be briefly described as follows: 
(1) Oral directions; (2) arithmetical problems; (3) “‘common sense’’; (4) opposites 
or vocabulary; (5) disarranged sentences; (6) numerical completions; (7) verbal 
analogies; (8) general information. For more detailed description, see reference 
No. 11. 

1 We have shown elsewhere (5) that in this rural sample, lower intelligence is 
associated with earlier childbirth, although not with the total size of completed 
families. 

2 We should not, however, overlook the possibility that the growing discrepancy 
between males and females in the upper ages may be real. Using correlations 
between age and score, both Carter (1, p. 44) and Price (9, pp. 12, 58-59) have 
observed a slightly more rapid decline among men than among women. 
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Alph 
3.9} 3.8) 6.3) 6.4) 6.1) 4.5) 5.3) 8.8) 9.7) 8.3) 8.1 


3.9} 4.9) 5.7) 6.5) 6.6) 5.0) 5.4) 7.2) 9.4) 8.1) 8.6 


5.6; 5.8) 7.1 


6.1 
9.5) 11.5) 15.6) 11.5) 14.3) 8.0) 8.7/12.0/21.1/16.6)19.6 


7.8) 16.1) 12.3) 13.3) 11.1) 8.0) 7.2/14.5)13.7/23.2/18.7 
8.9} 13.0) 15.0) 10.7) 11.9) 7.9) 7.5)13.6)14.8/20.8)19.4 
7.8) 12.1) 15.8) 8.9) 13.3) 6.3) 7.3)11.3)12.6)17.1/21.9 
8.6) 16.4) 16.8) 12.1) 11.6) 8.0) 7.4/11.9)13.2/21.7/21.2 


9.1) 12.7) 14.5) 12.1) 13.2) 8.3) 7.9/20 
7.4) 14.1) 13.4) 9.1) 11.4) 6.7 


9.3} 12.7| 14.2) 12.0) 13.7) 9.3) 8.6/20 2] 
10.2) 13.8) 15.2) 11.9) 13.3) 8.3) 8.8)17.2/19.8/20.5)18.6 


10.1) 14.0) 18.7) 11.7) 15.2) 7.9) 8.8)16.4/22.9)19.4/21.2 


7.6) 10.5) 10.1 


10.3} 9.4 


Lee he 





4.8 


6.9) 6.8 | 5.8 
5.9) 8.0 | 7.4 
8.8) 8.8 | 7.3 
9.6} 9.0 | 7.9 


9.4; 8.8 | 9.1 


4.5 
10.0} 9.0 | 9.3 


5.6) 5.3 | 3.9 








9.3) 9.9 | 8.3 
10.6} 8.3 | 9.0 
9.7; 8.6 | 8.2 
9.1) 8.3 | 6.8 
9.6} 8.7 | 7.7 
9.0) 8.2 | 6.1 
8.3) 8.7 | 6.4 
9.5) 8.5 | 6.3 


6.7| 6.1 


4.9) 5.1 
10.2} 9.0 | 8.1 
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Total 
alpha 


mi|r{(m|r{m([ri[M|[r[M|[r[M|r[M|Fri[M|Fi[M|Fi[M|F 


92.8/109.6| 59.6) 71.3) 7.0 | 7.4 


95.0} 99.7) 61.4) 66.7) 6.9 | 7.7 
22-24 | 82.0)102.6) 54.0) 68.7; 6.4 | 7.1 


74.6| 76.7) 50.8) 55.3) 5.6 | 6.0 
71.8)100.0} 51.4) 69.5) 6.2 | 7.6 
93.3} 95.9) 63.7) 64.5) 6.8 | 6.8 
95.3} 98.3) 65.3) 67.0) 7.2 | 7.1 


54.1) 59.5) 38.4) 42.2) 4.9 | 5.5 
54.4) 67.9) 38.2) 49.8) 4.9 | 6.0 


41.3) 47.2) 31.3) 34.7) 3.8 | 3.6 
43.6) 48.9) 30.9) 35.2) 4.8 | 4.6 








* Score on total Alpha test, minus scores on tests 4 and 8. 
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Tasie II].—Mean Raw Scores or Five Hunprep Eiauty-onz Mauss (M), anp Sixx Hunprep SEvEN FEMALES (F) 
19-21 


Age- 
group 








25-29 | 98.4) 83.5} 60.1) 52.1) 6.2 | 5.7 
30-34 | 83.1] 90.7) 53.9) 55.7) 6.4 | 5.5 
35-39 | 75.8] 91.4) 47.1) 55.6) 5.4 | 5.5 
40-44 | 93.7| 90.9] 54.7) 53.7); 6.3 | 6.1 
45-49 | 78.7| 82.1) 46.6) 50.7) 5.4 | 5.8 
50-54 | 67.8) 97.2) 38.9) 57.1) 5.4 | 6.5 
55-59 | 72.5) 84.4) 43.8) 47.2) 5.9 | 6.1 
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6.6| 5.8] 4.5 5.3] 9.7] 9.5] 8.0] 8.3 


5.9| 6.4| 7.8| 5.4) 5.3) 7.0) 8.6) 8.1) 6.5 
6.3} 8.7] 9.5] 6.5) 4.7) 9.3)11.3) 9.5)11.5 


6.2| 4.8] 7.7| 8.6] 10.2) 5.5) 6.9) 9.8)14.0) 9.7/11.9 
7.41 11.0} 11.0) 11.1] 10.8} 7.6) 8.1/13.4)14.7/15.0)12.2 
10.0 9.8] 12.6) 11.9] 13.0} 6.4) 9.0)13.0/23.7/14.7|16.1 











Alpha subtest number 
9.4] 14.0] 14.3] 11.9] 14.0) 8.4) 7.7/20.5|25.0)19.4/16.3 


9.3] 13.0] 12.7] 12.0} 13.8) 9.2) 8.8/20.0)/23.0/19.3)14.5 
10.3) 11.6] 14.0] 11.5} 14.0} 8.8) 9.0)17.7)17.0)21 .3/18.3 
9.8} 10.8] 17.0} 11.3] 15.0) 7.7) 9.3)13.0:24.2)17.0/20.5 
9.2} 10.5] 14.5] 12.3] 14.7] 7.8] 8.2) 8.6/20.0/15.0|20.0 
7.5| 15.0] 12.6) 12.7) 11.3} 8.6) 7.5)12.0)11.5/24.2/19.7 


8.5] 12.7) 13.3) 10.9} 11.9) 9.0) 7.8/12.0)11.5/20.5)19.2 
8.2} 11.0] 15.0} 8.0} 13.8) 6.8) 7.7) 7.7|11.3/14.0)22.1 
8.4] 16.3} 16.2} 12.6) 10.7) 9.3) 7.7) 8.7) 9.023.3/21.4 


7.5| 14.6) 14.2| 9.5] 12.2) 7.6) 6.3) 9.8) 9.8)17.0)19.3 
8.7| 8.5] 22.3] 7.5] 15.5) 6.3) 9.1) 4.6)12.0/16.0/23.6 


8.0; 7.0) 17.4) 10.5} 11.5) 2.7| 6.7) 7.0) 7.8)/17.0/21.4 


3.9} 3.2) 6.1 


4.0) 4.1 
5.8) 5.1 





6.9 
8.8 








4.5) 4.7 | 4.3 
5.7; 5.2 | 3.9 


6.8) 6.3 | 4.5 
7.5) 7.4 | 5.6 


6.0) 8.1 
9.5} 8.8 | 7.4 


8.8} 8.9 | 8.5 


10.5) 8.1 
9.1) 8.4 | 6.3 


9.0} 8.6 | 7.4 
10.0) 9.2 | 8.0 
9.0} 9.2 | 8.2 
10.2} 8.7 | 9.5 
9.9} 8.4 | 8.0 
10.0} 8.6 | 7.4 
9.3) 7.7 | 5.6 
8.0} 8.8 | 5.5 
8.6) 8.4 | 5.7 














41.5) 5.7 | 6.0 
49.2) 5.4 | 6.6 
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81.0) 80.6) 53.8) 54.2) 6.4 | 6.6 





Abbrevi- 
ated 
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Total 
alpha 


wirlmMlrlmM|rlm|rl{[ml|rl[m|rf[M|ri[M|F[M/P iM 


92.5) 91.7) 62.5) 58.5) 7.4 | 8.3 
75.0/101.7| 48.6) 72.1) 7.8 | 7.8 


40.0) 52.5) 30.0) 34.4) 4.8 | 4.9 
71.3) 96.0} 52.1) 68.8) 6.8 | 8.2 
92.5} 90.0] 63.3) 67.5) 7.4 | 7.8 
92.5} 92.5) 66.3) 67.5) 7.3 | 7.6 


40.0} 43.3) 32.5) 31.7) 3.8 | 3.9 
50.0} 55.0) 35.0 


50.8) 66.3) 37.5 








Age- 
group 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19-21 





30-34 | 80.0) 81.4) 52.5) 48.0) 7.4 | 6.2 
35-39 | 65.0) 88.3) 41.7) 53.2) 5.6 | 6.0 
40-44 | 93.3] 89.0) 53.0) 47.5) 6.8 | 6.3 
45-49 | 67.0] 80.8) 40.8) 48.2) 6.2 | 6.3 
50-54 | 54.0/101.7| 31.0) 56.3) 5.7 | 7.3 
55-59 | 60.0) 81.7) 40.0) 42.5) 6.0 | 6.5 


22-24 | 78.3/105.0| 58.8) 67.5) 6.8 | 7.6 
25-29 | 93.0) 83.6) 54.2) 51.1) 6.6 | 5.8 
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TaBLB IV.—RELIABILITY OF THE DIFFERENCE BETWEEN FEMALE AND MALE 
PERFORMANCE ON THE ARMY ALPHA TxsT* 



























































Total Abbreviated Alpha subtest number 
alpha alpha 1 

pom 2 3 
group | Mean | Relia-| Mean | Relia~} Mean | Relia-| Mean | Relia-| Mean | Relia- 

differ-| bility | differ-| bility | differ-| bility | differ-| bility | differ-| bility 

ence* |index*| ence | index} ence | index} ence | index | ence | index 
10-15 9.8 5.3 7.5 6.1 55 3.4 .29 1.6 .66 3.5 
16-21 7.1 2.1 5.3 2.5 . 28 1.4, —.88)| —3.4| 1.28 4.1 
22-39 | 4.5 1.5 2.6 1.4) —.26) —1.4)—1.10) —4.5 .24 9 
40-59 9.1 2.6 5.5 2.6 .42 1.9| —.65| —2.0) 1.38 4.8 
10-59 | 8.3 6.2 5.8 6.8 .27 2.8| —.41] —3.5 .81 6.4 

Alpha subtest number 
4 5 6 7 8 

Age- 
group | Mean | Relia-| Mean | Relia-} Mean | Relia-| Mean | Relia-| Mean | Relia- 

differ-| bility | differ-| bility | differ-| bility | differ-| bility | differ-| bility 

ence | index | ence | index] ence | index| ence | index] ence | index 
10-15 1.6 3.8 1.3 4.2 .61 2.6, 3.0 5.5 .65 1.3 
16-21 2.4 3.2 2.0 4.0 .10 .3| 2.8 3.0) —.17| —.2 
22-39 8 1.1 i | 2.5 .00 . oe 2.0| —.17;| —.2 
40-59 2.3 2.6 1.8 3.4 .35 - ee 2.1; 1.02 1.2 
10-59 1.7 5.4 1.5 7.0 .30 2.2} 2.4 6.6 .39 1.2 



































* In each case the mean score of the males has been subtracted from the mean 
score of the females. The derivation of the figures in the table may require a word 
of explanation. The basis of the present table is found in TablesI, IIand V. For 
each of the eighteen age-groups in these tables, we obtained the mean difference 
(female minus male), and the PE of this difference by the usual formula (6, p. 133). 
Then, for the group including the ages ten, eleven, twelve, thirteen, fourteen, and 
fifteen, the mean difference at each age was weighted inversely as the square of its 
PE and the weighted mean difference for the group aged ten to fifteen was thus 
obtained (cf. 8, p. 325). (The weighted mean difference is given in the table under 
the heading ‘‘ Mean difference.’’) A similar procedure was employed to obtain the 
weighted mean difference for the groups aged sixteen to twenty-one, twenty-two to 
thirty-nine, forty to fifty-nine, and ten to fifty-nine, respectively. The PE of the 
weighted mean difference was then obtained by the method recommended by 
Searborough (10). The reliability index is the weighted mean difference divided 


by its PE. A ‘‘positive” reliability index is here used to indicate female superiority; 
a negative index, female inferiority. 














RRS PR ST 


166 The Journal of Educational Psychology 


Whatever their importance with reference to other problems, 
sampling factors do not disturb the comparison between age-sex 
differences in the various subtests. In four tests (tests 3, 4, 5 and 7) 
the males are rather consistently inferior. These are all strongly 
verbal tests. In test 1 (oral directions) the males are in early adoles- 
cence as inferior as in the more highly verbal tests; but at later ages 
they acquit themselves relatively much better, overtaking the females 
in the period of middle maturity, and again dropping slightly behind 
in later maturity. Test 8 (information) is one which has commonly 
been regarded as favoring males; the information items selected 
undoubtedly in many cases bear selectively upon male interests and 
occupations (11, pp. 227, 234). Our obtained sex differences are rela- 
tively slight in this test, and with frequent intersections of the growth 
curves. In test 6 (number series completions) a similar condition 
occurs—intersecting curves, with small differences, readily attributable 
to sampling fluctuations. But the similarity of the results for tests 6 
and 8 can hardly be due to similarity of content. Test 6 is in fact 
more similar to test 2 (arithmetic problems). In this subtest we find 
the only instance in which males are fairly consistently superior to 
females. The male superiority does not clearly emerge until later 
adolescence, but thereafter is in striking contrast to the results for the 
verbal tests 3, 4, 5, and 7. This is probably an illustration of a sex 
difference which has been reinforced by differential practice and mental 
attitude (4). Tasks of computation fall traditionally to men, in 
connection with buying, trading, and planning works of farm con- 
struction. The farm wife describes herself as ‘‘no hand at figuring.’”! 

It is clear, then, that by the manipulation of test material, striking 
changes could be produced in obtained sex differences. A heavier 
weighting of arithmetic elements would bring the male averages up to 
or above those for the females. A test made up largely of selected 





1 Possibly this division of tasks and responsibilities may be taken as a symptom 
of a patriarchal system surviving longer under rural than under urban conditions— 
the male of the household being the chief executive, secretary, and treasurer. 
In modern city life, the functions of budgeting and accounting are now shifting 
to an increasing degree to women. In considering these speculations we shou!d 
not, however, neglect another hypothesis, which attempts to explain sex differences 
in arithmetical abilities in terms of inherent factors; women are naturally more 
proficient in the use of language; men excel in motor abilities, in handling objects, 
and in quantitative manipulation of all sorts. As yet, however, we are hardly 
prepared to do more than conjecture concerning the relative contributions of 
culture and of original nature in the development of these abilities. 
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items from subtests 3, 4, 5, and 7 would manifest an even more con- 
vincing female superiority, in the same group of subjects. Sex differ- 
ences vary both in direction and degree according to the nature of the 
test. And whatever the role of native factors, it seems clear that 
differences in interest and practice—not themselves hereditarily 
determined—must also be considered. This is in agreement with the 
conclusion previously reached, by an analysis of sex differences on 
individual items of other tests administered to the same sample (4). 

Taking the total Alpha for whatever it is supposed to measure, it 
must be conceded that the sex differences which we have shown are 
more largely of a theoretical than a practical significance. In early 
adolescence, the actual difference in Alpha score is only about one- 
third of an SD. This drops to one-sixth in later adolescence and to 
one-ninth in early maturity, increasing again to about one-fourth of 
an SD in later maturity. 

Some interest attaches to sex differences in variability. These 
differences are hardly large enough to deserve statistical analysis, but 
in certain age groups they show a fairly marked consistency (Table V). 
In both sexes, marked increases in variability occur throughout the 
period of adolescence. This increase, however, appears to be greater 
for boys than for girls. After adolescence, the variability of males 
tends to hold constant, or to increase slightly in later maturity, 
while the female variability tends to diminish slightly. This latter 
would be in agreement with the assumption of a slightly superior 
selection of women after the age of forty-five. 

In the range of ages from ten to sixty, the impressive fact is not the 
degree of sex difference, but rather the similarity of the developmental 
curves for the two sexes. 


SUMMARY 


The Army Alpha intelligence examination was administered to 
five hundred eighty-one males and six hundred seven females in a 
representative rural group between the ages of ten and sixty. A 
general slight superiority of females to males is observed; this supe- 
riority (as in physical growth curves) is greater during early adolescence, 
but is at no age sufficiently marked to be of great practical significance. 

The sex difference is by no means uniform among the individual 
subtests of the Alpha. In four strongly verbal tests (‘‘common 
sense,”’ opposites, disarranged sentences, and analogies), the males are 
rather consistently inferior; in two tests (numerical completions and 
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general information), the sex differences are relatively slight, with 
frequent intersections of the developmental curves; in one test (arith- 
metic problems), the males are quite definitely superior. 

These results confirm conclusions reached by an item analysis of 
other tests administered in the same sample. It is clear, then, that 
the direction and extent of sex differences in a composite mental test 
is dependent upon the composition of the test and the weighting of its 
parts. Further investigation of “‘intelligence”’ differences between the 
two sexes should be made in terms of specific functions or items, rather 
than in terms of unanalyzed total scores. 
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THE COLLECTING INTERESTS OF TOWN CHILDREN 
AND COUNTRY CHILDREN 


PAUL A. WITTY AND HARVEY C. LEHMAN 


Northwestern University and Ohio University 


In two articles which appeared recently in this JouRNAL' the 


‘writers discussed certain phases of children’s interest in collecting. 


In the first of the articles, data regarding the extent to which town 
children report interest in collections were presented. In addition, 
the problem of terminology was discussed. The fact was emphasized 
that ‘collection’ is a popular term which lacks precise scientific 
meaning, and that when attempts are made to employ this term 
scientifically, vagueness and ambiguity often result. In the second 
article, salient sex differences in collecting interests and activities were 
presented with the writers’ interpretation thereof. This article deals 
with another aspect of collecting; data will be presented comparing 
the collecting interests of town and country children. 

In Whither Mankind,? C. A. Beard and others have ably discussed 
certain of the changes in man’s mode of living that have been pre- 
cipitated by the advent of the machine age. This book, and most 
others of its kind, barely touches upon the changes in child life that 
have been brought about by the industrial revolution. This neglect 
is due, doubtless to several facts; among these the lack of pertinent 
data is perhaps the most important. Nevertheless, a moment’s 
reflection concerning the enormous social and economic changes which 
have accompanied the industrial upheaval will bring to mind many 
conditions that must inevitably be reflected in child life. For example, 
less than a century ago, in 1860, only three per cent of the people of 
the United States lived in cities. At the present time, approximately 
half of the population of the United States resides in cities. Such an 
unprecedented shift in the habitat of the population has brought 
numerous problems of adjustment for children and for adults. In 
his introductory speech before the White House Conference, President 
Hoover set forth the following opinions. 


In the last half century we have herded 50,000,000 human beings into towns 
and cities where the whole setting is new to the race. We have created highly 
congested areas with a thousand changes resulting in the swift transition from a 
rural and agrarian people to an urban, industrial nation. Perhaps the widest 
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range of difficulties with which we are dealing in the betterment of children grows 
out of this crowding into cities. Problems of sanitation and public health loom 
in every direction. Delinquency increases with congestion. Overcrowding pro- 
duces disease and contagion. The child’s natural play place is taken from him. 
His mind is stunted by the lack of imaginative surroundings and lack of contact 
with the fields, streams, trees and birds.*:* (Italics not in the original.) * P. 274. 


At a recent conference of educators, Norton described some 
values which the typical farm child of a generation ago gained from 
his out-of-school life. 


He performed the chores with which the farm aboqunds. As soon as he was old 
enough, he aided in such fundamental farm operations as sowing, hoeing and 
harvesting. When the horses needed shoeing he drove with his father to the 
blacksmith shop and saw the glowing iron fashioned at the forge and fitted to the 
horse’s foot. The milk bottle did not mark the limit of his knowledge as to 
the source of milk. He did not have to go to a zoo to seeacow. He knew all 
about their habits, and the chores connected with them from first-hand experience. 
Nearly every hour of the day brought him increased understanding of, and training 
for, life. He knew how real work was done, and learned how to do it. 

Compare this situation with that of the child living in the congested city. 
There is much less in the typical urban community that children can do. The 
grocer brings the eggs, machines milk the cow, and the baker bakes and delivers 
the bread. Actual participation in the work of the city is prohibited by child 
labor laws; and it is difficult to observe what goes on even at second hand, for 
“keep out” is posted in large letters at the entrance to the factory and shop. 
Often turned loose with nothing to do, children have to choose between play upon 
the crowded city street, association with the back alley hoodlum, or attendance 
at the cheap movie show.*:? * P. 274, 


With increased urbanization of America’s population, it becomes 
increasingly important for the educator to know as much as possible 
about the modern child’s needs. Although it is impossible to ascertain 
in detail the differences in the environments of children who lived 
forty or fifty years ago and of those today, it is possible to discern 
significant differences in the environments of present-day town and 
country children. These data should be suggestive of differences which 
have been brought about (in part at least) by urbanization. The 
environmental differences of present-day town and country children 
may be set in sharp relief by behavior studies, one type of which is 
concerned with the child’s collecting activities. Although the present 
study does not include data for children living in congested city areas, 
it does present comparative data for children living in a small city and 
for those living in a genuinely rural environment. 


——— 
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METHOD OF INVESTIGATION 


A check-list of one hundred ninety items was presented to large 
numbers of town and country children. The procedure that was 
used for assembling the one hundred ninety items and the directions 
which were given to the pupils may be ascertained by reference to a 
former article. The town children lived in Lawrence, Kansas, a 


university center having a population of approximately 15,000. 


Data for rural children were obtained from the rural schools in Medina 
County, Ohio. 


RESULTS 


Table I presents the average number of active collections reported 
by rural children of various school grades. Comparison of these 
data with those presented previously for town children makes it 


TaBLe I.—AveracGeE NuMBER or ActTIvE COLLECTIONS REPORTED BY COUNTRY 
CHILDREN OF VaRIOUS ScHOOL GRADES 











Number of Average Number of Average 
School grade bene number of gists number of 
collections collections 
2 81 8.22 54 9.67 
3 118 13.54 127 19.73 
4 149 13.85 118 19.43 
5 157 19.74 143 23.43 
6 127 10.69 116 12.48 
7 108 11.99 101 15.43 
8 72 9.41 57 9.26 
9 67 5.42 54 8.20 
10 63 6.11 50 12.52 
11 29 6.52 35 7.43 
12 37 6.51 27 6.22 
sR eee oe”)| | fee 882 

















evident at once that the rural child collects many more objects than 
does the city child. This fact is revealed by several lines of evidence, 
some of which will receive later comment. 


_.» «> 2 tate teh, tet Gt 





Collecting Interests of Children 173 


It will be noted from Table I that the largest average number of 


active collections is found for rural children in the fifth grade. In‘ 


studies of town children it has been found rather consistently that the 
largest average number of collections is made by children of age ten. 
This finding suggests that the country child matures somewhat more 
slowly than does the city child; typical rural children sometimes are 
found to be slightly older than city children for their respective grades. 

Table II presents items which were collected two or more times as 
frequently by country boys as by town boys. For the convenience of 
the reader the items of Table II have been classified in ten categories. 
It will be noted at once from Table II that the frequency distributions 
for the first three categories (botanical specimens, animal parts and 
insects, and rocks and minerals), give clear evidence that the rural 
child has closer contact (and doubtless better acquaintance) than does 
the town child with objects of nature. This fact is of course one that 
would naturally have been anticipated. 

Certain of the items.of Table II, particularly those listed under 
the heading, ‘‘ Miscellaneous,” will doubtless appear to the reader to 
be of a rather trivial nature. Many of these resemble possessions 
more than collections. This fact might lead one to conclude that the 
great frequency of reported collections among the rural children may 
be due to misunderstanding of the term “collection.” The thought 
may occur to the reader that the rural child reported not only his 
collections, but also provided an inventory of his possessions. Never- 
theless, several analyses of the data indicate that the rural child 
actually collects much more frequently than does the town child. 
For example, Table II presents seventy-eight items which were 
collected two or more times as often by country boys as by town boys. * 
Although seventy-eight items were collected two or more times as 
often by country boys as by towns boys, only one item (gopher skinsT) 
was collected more than twice as often by the town boys as by the 
country boys. The data for the girls are similar to those of the boys 
in this respect. (See Table III.) Of the items included in the check- 
list, one hundred nineteen were collected two or more times as often 
by country girls as by town girls. Only two items (jack-stones and 
toys [great variety]) were collected two or more times as frequently 





* Five items of Table II were collected slightly less than twice as often by 
country boys as by town boys. These five items were included in Table II because 
they seemed to fit well into the general categories of this table. 

+ This doubtless is due to the fact that gophers are rare in Ohio. 
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Tas.Le I].—ITems CoLuectepD Two or More Times as Orten By Country 
Boys as BY Town Boys 











1008 Country 808 Town 
boys, per cent | boys, per cent 
(a) Botanical 
Diu gidenhie weaee anise eks cicedeseeeeeeneue 7 3 
ee eee ine heaeukeebent an 15 6 
Pb bhaeeubhanens eodddeeine bouesdccnssuceosedsescons 5 1 
PD ccccpescesedesvetesaneseuesdeeedsoons 3 1 
as ak dead 3 1 
tee siiithbetabendadeeess ees oen6eens sone 2 1 
DMs. siGhst ned debdatsesewebetatssedsoesadned 3 1 
(b) Animal parts, insects, etc 
Decks Wub a bead ebbb hss 00e0e Gbvbss she 0a54oss sses 6 2 
ee eae tease beanie betaneb nade s 4 
sco geeancadbeveescresessésbeses 3 1 
i ee eed reece e ae ee Ohn wee dame ee ke Oa 8 3 
EE a 4 1 
ti ceh ee eine cba hed ekdenebades see eveenue 9 3 
i ictbcbnekneneeeees cbetedwatee¥eessevres 2 1 
is Sia Sid Baca a aad os aaa aaa us leans HS Be ae 5 2 
chad dada eee eae EWC RAO ea ee RW Odeo ee 2 1 
Dt Cecmiet anew ecaaie 606 he éaweiddeseneves vende 6 3 
I iat Sik whe Kak tka hk ae ene wee ano’ 5 1 
ee eee ta bbeh ahaha tiwidebekee 4 0 
(c) Rocks and minerals 
ia cae de Ani deodiey raid a dua le ale ate ER oA md 5 2 
i thodan ddewdweswiodde asthe seeds eus 6 b0b<eds oes 17 6 
ED. a to ede ane aie eae kPa > ws pa Kae 6 3 
NS is oc dik. eee Kea ke Sb hOS ROSH ON ROS ee Kite 4 2 
Dc che eeaknevevegnedeseeséndsabcacaceueskees 12 6 
EE ES eT ee EE ee ey ey oe 3 1 
a 5. cos ating biabete a aid he we eae eee 12 5 
i. ana bacb del we tdbndeeseneasedentdveees 4 1 
(d) Salable items 
i scceeheewnedhedadadesane wahdeeebenboesieeuns 13 6 
NG Teta adied siti hit be heed mabhw ab ede bab weed 3h aes 6 2 
il aac dae gid cial Stak a hh hi Gok as MA wae ee 7 3 
el Ee ee a te ee eege tha eew arene 18 2 
ns isn wala ea eee 2 ae dew bauee | 16 1 
EE Tr ee 5 0 
(e) Objects associated with hunting, fishing, hiking, etc. 
i tLe ewe bhGebeh weed Keoebie et eae 27 6 
ss ckvcis ease ees ane seebbesoeed 17 9 
Dh cictnb deste tens tevetsedetenerens veseeés 14 9 
td ch benenh enedsudeeesbebedentesdies 8 5 
oe it old cia oe Ca a bene gd bake Re ae wee ea aK ea 14 9 
aa wlathrblyika pitas dae 5 OA ae oe Ree ENS hve eee 9 4 
ES co eK ORed Awd eS kee a Raabe eae se 2 1 
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1008 Country 
boys, per cent 


808 Town 
boys, per cent 





(f) Objects for personal adornment 
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(9) Objects which possess sentimental value 
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(kh) Souvenirs from the schoolroom 
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(i) Objects essential to certain types of play and recreation 
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Grats as By Town GIRLS 





882 Country 
girls, per cent 


868 Town 
girls, per cent 





(a) Botanical 


EEE es ee, oe 
ang (kinds of). . 


Moss (kinds of) . 

Nut shells (kinds of). 
Oak balls. . 

Wood (samples of). 
Rose petals. . 

Trees (samples of). 


(b) ae parts, insects, etc. 


Noe 


CANE OAAANOK AD 


Birds’ oe a and claws. 


Birds’ nests. 
Birds’ wing: | 
Butterflies. 

Cocoons.... 
Dragon flies 


Rabbit ears. 
Snails...... 
Skins.... 
Spiders. 
eeth...... 
Rabbit feet. . 


(c) Rocks and minerals 


Sand, colored... 
Shells 


Stones (kinds of) . Pies 


Soil (samples of) . 


(d) Salable items 
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882 Coun 
Ryan A 


868 Town 
girls, per cent 





Buttons, pictures. . es 
Buttons, cuff. . oh 
Neckties (No. ‘of kinds) . 
aoe RSs és w higher ® 4 


Pins, ie tee Le 


(g) Souvenirs from the schoolroom 
Drawings...... , . 
Compositions. . ae 

School books.............. 


(a) ae eeeential to certain types of play end recreation 


Doll buggies. 
Dogs (toy).. 
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Marbles. . 
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Tops. 3 
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Song books (kinds of).. 
(i) Miscellaneous 
Glasses (pieces of). . 
Blank bank checks. . 
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by town girls as by country girls. Very few items were collected more 
often by town children than by country children. (See Tables IV 
and V.) The data appear to present rather conclusive evidence of 
the relatively great interest of the country child in collecting. 


TaBLeE 1V.—ItTems Co.LLectep SOMEWHAT Morr OrtEN BY Town Boys THAN 
BY Country Boys 








808 Town boys,| 1008 Country 

per cent boys, per cent 
sack ark ks ee gia pbs Saskia a ye 0 hes ou 8 7 
eo ia id eee ae aks wechtonil 4 3 
sabe hk ine ch baea been noone ews 6 4 
hace eee Os ees cee in es wiew a 4 3 
EEE Oe a ee ee OE 4 1 
ACER ee ee 30 22 
Di cneh'e chGeheteeesand eedekenssoas 17 9 
CR Gkkdi b556 bb paves e eackaseiaeondee 5 4 
Ns nck thin eenacbehceeedaebesaeee 6 4 
tet e Redken eeia ev eeeeeseda ben 18 13 
i icin banaedeseeubekwkekoee 4 3 











* Both gophers and rabbits are much more common in Kansas than in Ohio. 


The fact should perhaps be mentioned that among the rural 
children, the sex differences in collecting interests were found to be 
almost identical with those which the writers have reported previously 


TaBLeE V.—ITemMs CoLuecTED SoMEWHAT More OrtrEen BY Town GIRLS THAN 
BY Country GIRLS 





868 Town 882 Country 
girls, per cent | girls, per cent 





ID, nccncecncesscevasenes 14 13 
REL oedeeks ed dues ene Read pee ke 9 4 
SE ee re 11 g 
Me II III Do nn cw rcccccccccsccecces 4 1 











for town children.’ Instances in which a previously reported sex 
difference was reversed are few indeed. The validity of the general 
categories employed previously for describing sex differences in 
collecting interests are corroborated therefore by the present findings. 
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e In a study that was reported in 1900, C. F. Burk wrote as follows: 

V 

if It will be seen that the boys concentrate more on a few things which run as 
crazes through very many of them . . . nearly half of them had collected birds’ 


eggs. Nothing attains such widespread interest among the girls, although between 

thirty and forty per cent of them had collected stamps, shells and picture cards, 
N but a greater number of things have considerable runs with the girls. The table 

shows that only seven things were collected by ten or more per cent of the boys— 
- while sixteen things were collected by ten or more per cent of the girls. P. 186. 


‘ Tables VI and VII show that the writers’ studies fail to corroborate 
- Burk’s finding that ‘‘the boys concentrate more on a few things which 
seem to run as crazes.”” Nevertheless, these data do support Burks’ 
finding that ‘‘a greater number of things have considerable runs with 


TasBLE VI.—Extrent To Wuicn Town AND Country CHILDREN ‘CONCENTRATE 
on A Few Tuinos Wuaicu Seem To Run as CrRAzES”’ 





Town Country 
children children 





Boys | Girls | Boys | Girls 




















" Largest per cent collecting any one item........... 30 28 27 28 

Number of items collected by ten or more per cent 
‘a "RRR RE Re ea 6 | 23 | 30 | 55 
ly Number of items collected by twenty or more per 

I vikvccnccedececcesnsdewcecs 2 2 4 11 

AN 
_ the girls.” But the foregoing finding is probably due merely to the 
. fact that girls exceed boys in the average number of collections made. 


it In Table VI it will be noted that thirty items were collected by tenor 
-_ more per cent of the country boys and that only six items were 
collected by ten or more per cent of the town boys. One might con- 
clude from this that ‘‘a greater number of things have considerable 
runs with” the country boys. This conclusion would be unwarranted 
and extraneous because the average number of collections of rural boys 
exceeds that of city boys. 


eX Burk’s report that ‘‘the boys concentrate more on a few things 
ral which seem to run as crazes,’’ was due probably to local or to temporary 
in conditions. Miss Whitley failed to corroborate this finding*:* and the 





g8. *P. 254. 
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writers’ data also do not substantiate it. When the present data are 
partitioned by school grades, one sees that, in nine out of eleven grades, 


TaBLeE VII.—Larcest PERCENTAGE OF CountTRY Boys AND Country GIRLS 
Co.tuectina Any ONE ITEM 











po Most popular item bined A Most popular item a, 
2 | Marbles (also tinfoil). . 29 | Story books (also hdkfs).... 40 
3 | Marbles... bie hikdhaa’ Se ccs caneeieee aa 34 
4 Sait .............. 26 | Valentines................. 34 
5 | Arrowheads.............  - , er ere 40 
6 | Arrowheads............. Ee 27 
OT dese nasaes can Oe cot naceeeen en 34 
8 | Arrowheads............. 28 | Drawings (different kinds). . 28 
9 | Arrowheads............. fee ee 35 

10 | Arrowheads............. ann ab hace dee ke 48 
11 | Arrowheads............. 24 | Souvenirs (also photographs) 40 
12 | Arrowheads............. 26 | Dance programs............ 40 














TasBLe VIII.—Irems CoLLectEeED More Orten spy Town Boys THAN By Town 
Girts, BuT More Orren sy Country Girts THAN BY Town Boys 








868 Town | 808 Town | 882 Country |1008 Country 
girls boys girls boys 
a aie eee 1 2 5 5 
Sa 2 4 5 8 
EL, occ toe asccewn 2 3 8 8 
ren ee Pere 1 3 5 4 
Rubber.... Sidrde aiNab ee kei 1 3 4 7 
Tobacco tags.......... ae 0 1 2 3 
Arrowheads... 1 6 10 27 
a OF Eck Gad wen 1 6 8 17 
icc aoe ans owe 3 9 27 16 
eee 0 1 2 2 
Rubber sheets.............. 0 1 2 2 
Sacks (kinds of)............ 1 2 3 4 
Tags (shipping)............ 0 1 2 2 
III. a 6 oa sk ee eaves 1 2 3 3 
Matches......... 1 2 3 5 

















the country girls ‘‘concentrated more” than did the country boys in 
collecting certain objects. (See Table VII.) 
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The fact is perhaps worthy of mention that certain items which 
were collected more often by town boys than by town girls were 
nevertheless more frequently collected by country girls than by town 
boys. (See Table VIII.) Similarly, certain items which were 
collected by town girls more often than by town boys were collected 
more often by country boys than by town girls. (See Table IX.) 
Such findings suggest that environmental influence is in some instances 
more potent than is sex in determining the extent of collecting interests. 


Taste [X.—ItTems Co.LuecTtepD More Ortren sy Town Giris THAN BY TOWN 
Bors, sut More Orren sy Country Bors THan sy Town GIRLs 
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It is difficult not to speculate regarding these data. For example, 
why should the country child collect so much more frequently than 
the town child? Numerous factors of course are operative. It is 
the belief of the writers that every child has to work out an adjustment 
to his environment by trial-and-error experience. Since it is often 
difficult for the country child to find congenial playmates of his own 
age, he is obliged often to engage in individual endeavor. Collecting 
is an activity that is easily accessible and apparently congenial for the 
country child. And the rural child avails himself of this form of 
amusement with unusual frequency. 

Nevertheless, many other factors are doubtless operative in 
stimulating the urge to collect. Possibly in his collecting, the rural 
child is simply imitating to a goodly extent the adult life that forms a 
part of his social environment. There is frequently space on the 
farm for worn-out or partially used articles of various sorts to be stored. 
Since the junk-man rarely makes his rounds in the country such articles 
do accumulate. A tendency toward habitual thrift may also be a 
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causal factor of no little import in producing the country child’s 
excessive interest in collecting. Furthermore, in some instances the 
child may be merely following the line of least resistance, since it may 
be easier for him to allow objects to accumulate than to destroy them. 


PEDAGOGICAL IMPLICATIONS 


Collecting is frequently an individualistic, competitive type of 
behavior. The child rarely engages in collecting as a member of a 
group or of ateam. He often tells about or displays “‘my” collection 
rather than ‘‘our” collection. He collects frequently because he sees 
other children collecting and his goal is likely to be that of obtaining a 
larger or a more valuable collection than his rival. 

What effect this individualistic, competitive type of behavior is 
likely to have upon numerous attitudes of the rural child is a matter 
of conjecture. It seems plausible, however, in the light of the laws of 
habit-formation, that the collecting endeavor of the rural child might 
make of him an individualistic adult. Today many assert that the 
American farmer is inclined to be individualistic in his outlook and in 
his attitudes—too individualistic for his own good. The charge is 
often brought that farmers are incapable of codperating politically 
(and also in other ways). Whether this charge be valid or not, the 
data in this paper suggest that geographical isolation might well be 
a factor that makes for individualism. 

A foremost objective of the modern school is expressed by the 
single word, ‘‘socialization.”” This being the case, it seems plausible 
that, if interest in collections is a fair sample of his generalized behavior, 
the rural child is in even greater need of a socialized school program 
than is the town child. The frontiersman, the pioneer, and the 
farmer have doubtless lived in environments that made for individual- 
istic outlook. Their children have also been subjected largely to 
similar influences. But if the modern child is to live his adult life in an 
urbanized environment, it is necessary that he learn coéperation. 

In other places the writers have pointed to the differences between 
the play activities of town and of country children. Some of the 
differences are probably to the advantage of the town child; some of 
them probably favor the rural child. Country children were found to 
engage more often than town children in play activities such as the 
following: 
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Blackman (Pom-pom-pullaway) Visiting or entertaining company 


Horse-back riding Teasing somebody 

Hunting Throwing rocks or stones 

Shooting Swinging 

Andy-over Wading in the water 

Teeter-totter Climbing porches, trees, fences, posts, etc. 
Gathering nuts Using a hammer, saw, nails, etc., for fun 
Hop, skip, and jump Cutting paper things with a scissors 
Looking at pictures Drawing with pencil, pen, chalk, or crayon 
Just singing Playing with pet dogs, pet kittens, etc. 
Whistling 


Town children were found to engage more often than country 
children in the following: 


Football Going to the movies 

Basket-ball Listening to the radio 

Riding a bicycle Doing gymnasium work 

Playing cowboy Watching athletic sports 

Boxing Going to parties or picnics 

Marbles Looking at the Sunday “funny paper” 
Roller-skating Playing robber and police 

Playing Indian 


Very interesting in this connection is the fact that town life tends 
to suppress certain natural modes of self-expression, for example, 
whistling and singing. Whistling and singing were found by the 
writers to be participated in by country children‘much more commonly 
than by town children. Although rural life seems to permit freer 
expression to certain natural modes of behavior, it affords less oppor- 
tunity for participation in certain organized group activities.* To the 
foregoing statements the following may be added, viz., rural life 
permits the child to maintain his contact with the objects of nature 
that are listed in the first three categories of Tables II and III. Rural 
life tends also to direct the child to individual endeavor. One of the 
outcomes of the latter fact is a marked tendency on the part of the 
country child to make collections. 
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DISTRIBUTIONS OF SCORES BASED ON CORRECT 
GUESSING FOR ee E TESTS OF VARIOUS 
LENGTHS 


WILLIAM C. F. KRUEGER 
College of the City of Detroit, Detroit, Michigan 


The purpose of this study was (1) to determine experimentally the 
distributions of frequencies based on the number of correct guesses in 
‘“‘true-false tests” of various lengths; (2) to find a practical length 
which would eliminate the probability of getting a high score by chance 
guessing; (3) to observe the function of the formula, “Right minus 
Wrong equals True Score,’”’ when applied to scores obtained from 
tests of various lengths. 

The subject received a sheet of paper containing either 10, 20, 30, 
40, 50, 60, 80, 100, 150, 200, 300, or 500 serially numbered lines. He 
was asked to imagine that he was writing a “‘true-false test.”” Each 
line on the page was to represent a statement which he was to judge as 
“true” or “‘false.”” Since the line could not furnish a cue with respect 
to the truth or falsity of the test item, the subject was forced to guess. 
(The straight line was chosen instead of the worded statement because 
the worded statement might suggest an answer to the subject.) In 
some of the tests the subject was asked to mark in random order 
fifty per cent of the statements ‘‘true”’ and the rest “false.’”” He was 
instructed not to use any scheme in marking, for example, he was 
not to alternate the “true” and “‘false’’ answers. For the other tests 
he was requested to mark the answers at random as “true”’ or “‘false”’ 
without keeping count of the frequency of the “true” or “‘false”’ 
signs. Again no scheme in answering was to be followed. 

All papers were scored by a key in which fifty per cent of the 
answers, chosen at random, were ‘‘true”’ and the rest “‘false.’”’ The 
frequency distributions of the number of items guessed correctly as 
“true” or “‘false’’ were tabulated in Tables I and II. Every fre- 
quency distribution for each length of test is based on the judgments 
of one thousand subjects. The class intervals in Table I are in units 
of ten per cent, such as one per cent to ten per cent, eleven per cent to 
twenty per cent, twenty-one per cent to thirty per cent, etc. The 
class intervals in Table II are in units of five per cent. The second 
classification contains data from Table I, but it shows more accurately 
the distribution of scores, particularly for the longer series. Since 
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TaBLe I.—DistTrRisuTions or Scores Basep oN CorrEcT GUESSES FOR THE 
TRUE-FALSE TEsts RANGING FROM 10 To 500 ITEMS PER TEST 





Number of items in the test 





I, per cent 10 items | 20 items | 30 items | 40 items | 50 items | 60 items 








B} | cz: | Bt | ct | Bl crl| Bl ct| Blot Blas 
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Explanations: B'—Frequencies for the respective class intervals when the 
subject limited his guesses to 50 per cent ‘‘True’”’ and 50 per cent “False.” 

C*~—Frequencies for the respective class intervals when the subject guessed 
at random and did not mark one-half of the items ‘‘True”’ and the other “‘ False” 
by actual count. 

I—Class intervals in steps of 10 per cent of the total number of possible answers. 
The ‘‘0 per cent”’ class interval records the frequencies of 0 items guessed correctly. 
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one person guessed zero per cent correctly in the ten-items tests, 
Table I also contains a special class interval for zero per cent guessed 
correctly. The corresponding SD’s and PE’s are also given underneath 
the respective columns. 


Tasup II.—Distrisvutions or Scores Basep ON CorrREcT GUESSES FOR THE 
TRUE-FALSE Tests RANGING FROM 20 To 500 IreEMs PER TEST 





Number of items in the test 





i f Seca 500 
I 20 items | 40 items | 60 {tems | 80 items | 10Q items| 200 items | 300.items| 500 items 
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Explanations: I—Class intervals in steps of five per cent of the total number of answers possible. 

B'\—Frequencies of guesses for each respective class interval when the subject was instructed 
to mark fifty per cent of his guesses “ True"’ and fifty per cent “False.” 

C*—Frequencies of guesses for each respective class interval when the subject was directed to 
guess at random without an attempt to mark as many times “ True” as “‘ False.”’ 


It can readily be observed that in both tabulations there is a definite 
trend which indicates that the longer the test is, the greater is the 
frequency of correct guesses within the range from forty-one per cent 
to sixty per cent of the number of items contained in the test. 

If one applies the formula, “‘Right minus Wrong equals the True 
Score,’”’ to the various scores obtained, many unusually high and low 
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(negative) scores are obtained from the ten-items and twenty-items 
tests. However, in the longer tests, practically every score yields a 
final mark of approximately zero. 

If length of a test is plotted against the Standard Deviation for the 
scores obtained from that test, the resultant curves are negatively 
accelerated and indicate a consistent relationship between the two 
factors. 


CONCLUSIONS 


1. A definite and obvious trend indicates that the longer the test is, 
the greater is the frequency of scores within the class intervals ranging 
from forty-one per cent to sixty per cent of the number of items in the 
test. For the longer tests, practically all scores ranged within forty- 
five per cent and fifty-five per cent of the total test. 

2. Chance guessing may frequently yield very high and very low 
scores in short tests. In the longer tests this probability is practically 
eliminated. 

3. When the formula, ‘‘ Right minus Wrong is the True Score,”’ 
is applied to the longer series, it furnishes a fairly adequate means to 
eliminate the probability of getting high scores by guessing only. 
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TEACHING OBJECTIVES IN EDUCATIONAL 
PSYCHOLOGY 


ROBERT A. DAVIS 
University of Colorado 


Educational psychology has served as a blanket term to include 
the fields of measurement, statistics, child psychology, mental hygiene 
and learning. Colleges and universities offer specific courses for 
treating the subject matter listed under the general term educational 
psychology; consequently students taking courses so designated often 
duplicate previous work. The topic of learning receives little atten- 
tion, whether it is presented in a separate course or as one phase of a 
general course in educational psychology. This tendency may be due 
to the inadequacy of research and to the fact that available studies 
emphasize nonsense syllables, sorting of cards, rote memory and 
motor skills and do not stress those types which are characteristic of 
learning in the classroom. An analysis! of objective studies in edu- 
cational psychology yields evidence that research in learning is ill- 
adapted to the teacher’s use and that results of investigations are 
conflicting and therefore confusing to the teacher of educational 
psychology. 

Although there are obvious limitations to research in learning the 
major emphasis in a course in educational psychology should be placed 
upon learning and those internal and external factors which bear 
upon its development. The courses in educational psychology 
developed for students of education should especially stress learning. 
If it is agreed that educational psychology should direct attention 
to the problems of learning there is the further question of selecting 
the content for a course in this field. In addition to those factors 
which underlie learning ability there are a number of internal and ex- 
ternal influences which should be considered as topics in their relation 
to the ease and permanency of learning. A partial solution to the 
problem of content may be found by an examination and classification 
of objective studies which deal with learning in its various forms under 





1 Davis, Robert A. and C. R. Ballard: The Development of Research in Learn- 
ing. Journal of Educational Psychology, February, 1932. 
See also Henmon, V. A. C.: Needed Research in Learning. Journal of Educa- 
tional Research, Vol. XI, 1925, pp. 313-321. 
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diverse conditions. When such studies are classified according to 
established criteria there is found a large number of not clearly differ- 
entiated classes of subject matter in educational psychology. How- 
ever, the functions of these materials as they are related to the field 
and student have not been clearly defined and outlined, and there is 
no agreement as to objectives! among textbooks in educational 
psychology. As a result there is neither conformity in objectives nor 
subject matter in this field. Objectives should be established with 
two purposes in mind; one set of objectives should be outlined from the 
standpoint of the field of educational psychology itself, and the other 
with the view to making clear the important abilities and skills which 
the subject should serve to develop in students. 

In establishing objectives in educational psychology definite as- 
sumptions must be made. It must be assumed that the student has 
been prepared in basic courses, and in those topics usually covered in 
general psychology. In addition he should possess some of the perti- 
nent facts dealing with physical and mental growth, and emotional and 
social reactions as they apply to childhood and youth. In general 
the student should have some appreciation of the intricate inner life 
of the individual which would create insight and sympathy for the 
growing mind. These are minimum requirements and considerably 
more training in the techniques of experimental research and psy- 
chology would be needed for those students of advanced standing who 
intend to carry forward work for higher degrees in education or psy- 
chology. For advanced students the objectives would necessarily 
differ from those outlined for prospective high-school teachers who 
expect educational psychology to contribute toward their professional 
training. The subject matter of educational psychology is so inextri- 
cably allied with functions and objectives, that no attempt is made 
to differentiate between outcomes realized by the students and the 
content employed. Educational psychology should serve: 

1. To Develop an Understanding and Appreciation of Those Heredi- 
tary and Environmental Factors Which Underlie Learning Ability.— 
Investigations dealing with the influences of heredity and environ- 
ment should be studied and evaluated, the emphasis being placed 
upon mental rather than biological traits. This objective also includes 
a consideration of the nature of improvement, and an evaluation of 





1Good, Warren R.: The Need for Objectives in Educational Psychology. 
School of Education Bulletin, University of Michigan, April, 1931. 
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the theories proposed for its explanation; the inter-relationship of 
mental processes, the nature and significance of intelligence, and the 
character of learning ability. A foundation and point of view may be 
established which will provide the structure upon which later objec- 
tives may be based. 

2. To Provide Bases for Understanding the Nature and Principles 
of Learning and to Supply the Techniques for Guidance in Learning.— 
An interpretation of physiological and psychological learning should 
be considered with the theories of learning, and the laws operating to 
affect economy in the learning process. In addition the student should 
have drill in the application of these learning principles to class-room 
conditions. Educational psychology should develop ability in the 
selection and application of techniques which will elicit the most 
favorable learning responses. This application of learning principles 
should be adapted to the varying degrees of maturity and ability 
of pupils so that there is a progressive adjustment of materials and 
techniques to pupils as they develop through the several stages of 
mental and educational growth. 

3. To Provide a Foundation for General and Specific Methods of 
Teaching.—This objective can also be attained through an under- 
standing of learning principles and laws. Methods of teaching are 
valid only when based upon a knowledge of the ways by which learn- 
ing takes place and those things which influence it. Teaching tech- 
niques consist chiefly in presentation of materials and provision of 
incentives for the attainment of definite ends. The function of 
methodology is to develop techniques for making effective the opera- 
tion of the laws of learning. Although each school subject is probably 
individual enough to justify separate treatment in a course or topic, 
the general laws of learning are sufficiently comprehensive to serve as 
guiding criteria for all learning situations. 

4. To Train the Student in the Recognition of the Different Types of 
Learning and to Develop Ability in the Application of These Types to 
the Various Fields of Subject-matter—Since the principles of learning 
do not apply with equal force to all types of learning, educational 
psychology should develop the ability to differentiate between those 
types of learning which involve predominantly motor responses and 
those which evoke mental reactionsfrom pupils. Again, training should 
be directed to the differentiating process and to applying the types of 
learning to the learning and teaching of school subjects. As between 
motor and mental learning which are not clearly differentiated, the 
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major emphasis should be given to mental learning of the sort which 
is employed by pupils in mastering the abstract subjects of the school. 
Attention should also be directed to the technique of reasoning and 
reflective thinking. Training in the technique of reflective thinking 
is one of the aims of all education but the appropriate place for its 
development is in the field of educational psychology. 

5. To Train the Student in the Recognition and Significance of 
Individual Processes and Traits.—Under this objective is noted the 
part played by such factors as drives, the development of perception, 
observation, interest, and attention. Other factors are the influence 
on ability and achievement of physical traits and various types of 
sensory defects. The bearing of these factors on learning and on 
the adjustment of the child to the school are basic in the study of 
educational psychology and should be correlated with the general 
theme of learning. The aim is to recognize the existence and develop- 
ment of those traits and mental processes which are concomitants of 
child development as they relate to learning and teaching. Although 
these factors are susceptible to training and education, they are 
primarily a phase of native equipment, and may, therefore, be con- 
sidered as predominantly inherent in character. 

6. To Provide Understanding and Ability in the Recognition of 
Those External Factors Which Are Largely within the Control of the 
Teacher and School.—In one case the factors bearing on learning are 
predominantly inherent in the individual and in the other they are 
largely external in nature and within the control of the teacher and 
school. These factors include’ modes of presentation as related to 
reception of stimuli, the use of guidance techniques in teaching, the 
influence of continuous work and fatigue, schedules, motivating 
devices, ventilation, and humidity of classrooms. These influences 
have been studied with varying purposes in mind both in the laboratory 
and school but the important conclusions of investigators may be 
related to the problem of learning efficiency. From the standpoint 
of the function of the school improvement is affected largely by the 
development of techniques of studying and learning and by the 
provision of favorable school environment. 

7. To Provide the Student with Ability and Skill in the Methods of 
Measuring and Evaluating Teaching Efficiency.—Although the student 
may take measurement courses, educational psychology should train 
in measurement as it relates to learning and teaching. Educational 
psychology should not only develop techniques for producing learning 
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but it should also serve to provide ability and skill in measuring the 
effectiveness of teaching devices and methods. Measurement is 
essential in educational psychology and is used in all investigations 
dealing with learning. However, this objective should emphasize 
the significance and use of standardized measuring instruments for 
estimating the amount and quality of pupil achievement for definite 
periods of instruction. Through the use of objective measuring 
instruments, the teacher can test the relative effectiveness of methods 
and techniques of teaching which have been employed. This training 
should deal with the concepts of reliability, validity, and objectivity as 
they apply to measures of achievement in the various school subjects. 
The efficiency of teaching and the degree of pupil achievement can be 
reliably determined only by the use of objective examinations. 


GENERAL OBJECTIVES 


In addition to the foregoing teaching objectives, there are genera! 
objectives which should be observed at the outset of study in educa- 
tional psychology. These objectives include those of information, 
attitudes, skill, and appreciation which in a degree are common to all 
fields of education. Educational psychology should serve: 

1. To Provide the Student with a Body of Factual Data and Methods 
Which Can Be Used in Solving His Own Teaching Problems .—From 
the viewpoint of the use of objective tools and devices educational 
psychology is a pioneer among the fields of education, and experi- 
mental methods of research have been the chief means by which 
problems in this field have been studied. As a result a considerable 
body of scientific material was available before the other fields of 
education had begun to use the techniques of objective research. 
The student should derive a body of factual information from his 
study and possess some knowledge of the methods and techniques 
used by investigators for arriving at the findings from which his 
information is obtained. Too frequently information is secured but 
no understanding of the means by which it is produced and inter- 
preted. Some knowledge of the techniques of experimental research is 
essential for an appreciation of the readings in educational psychology. 


2. To Develop the Scientific and Problem-solving Attitude of Students. _ 


Due to the controversial nature of the problems in the field of educa- 
tional psychology and the refinement of tools and techniques used 
in the study of these problems the conclusions of research studies 
are constantly modified. It is important that the student begin his 
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study of educational psychology with the determination not to 
accept any conclusion as final for which conclusive proofs are not 
available. In no other way can he acquire an understanding of the 
method and spirit of science in education and of educational psychology 
in particular. 

3. To Train the Student to Think Psychologically about Educational 
Problems.—Educational psychology seeks to apply psychological 
principles to the problems of learning and teaching. These psycho- 
logical principles apply in one case to the pupil who is the learner, 
and in the other to the teacher who organizes materials of instruction, 
and stimulates desires so that learning may be effectively guided. 
The student should therefore, think in psychological terms of pupils, 
teachers and methods of teaching as they are functionally and inter- 
dependently related. 


REFERENCES 


Davis, Robert A.: Changing Tendencies in General Psychology. Psychological 
Review, Vol. XXXVI, No. 4, July, 1929, pp. 320-331. 

Davis, Robert A. and C. R. Ballard: The Development of Research in Learning. 
Journal of Educational Psychology, December, 1932. 

Henmon, V. A. C.: Needed Research in the Field of Learning. Journal of Educa- 
tional Psychology, Vol. XI, 1925, pp. 313-321. 

Hertzberg, O. E.: The Opinions of a Teacher-Training Institution Concerning the 
Relative Value of Subject-matter in Educational Psychology. Journal 
of Educational Psychology, May, 1928. 

Watson, G. B.: What Shall Be Taught in Educational Psychology? Journal of 
Educational Psychology, December, 1926. 

Good, Warren R.: The Need for Objectives in Educational Psychology. School 
of Education Bulletin, University of Michigan, April, 1931. 











ot 
he 


ral 
cal 
10- 
er, 
on, 
ad. 
ils, 
er- 


cal 
ng. 
ca- 


the 
nal 


l of 


A SIMPLIFIED STATEMENT OF THE TWO-FACTOR 
THEORY 


W. LINE AND H. B. HEDMAN! 
University of Toronto 
1. PURPOSE OF THIS PRESENTATION 


Students of psychology are, quite properly, brought up in an atmos- 
phere of controversy. From their earliest acquaintance with the 
subject, varying points of view are contrasted and compared, so that 
the subtleties of mental science are increasingly appreciated. Scien- 
tific doubt is fostered ; new systematic positions are examined critically, 
and attacked from many angles; so that he who would express his own 
individuality by formulating an original system of psychological con- 
cepts, must be prepared to defend himself against a vast array of 
spirited rivals, and to convince sceptics of a most obstinate kind. 

This state of affairs is a healthy one, in a field as complex as that 
of human behaviour, where personal opinion is so vitally interested, 
and hence where temperamental bias can be overcome only with great 
difficulty. At the same time, however, it is quite obvious that, if 
carried too far, doubt becomes a handicap; for one’s appreciation of a 
system may be seriously impaired by excessive caution, such as fre- 
quently occurs when the system involves the acceptance of the con- 
clusions of allied sciences. The field of knowledge has become so 
vast, that scholarship of the comprehensive type manifested in a 
Newton, or attempted in a Spencer, is no longer possible. Hence the 
study of any one branch of science involves a combination of faith and 
scepticism. Doubt carried to the Cartesian extreme is likely to con- 
stitute a negative influence, to become an end in itself, and, in the 
realm of psychology, to be other than wholly impersonal. None the 
less, when the major roots of a system penetrate the heart of another 
field, the ambitious student is to some extent justified in demanding at 
least a logical demonstration of the validity of the primary postulates. 

Spearman’s ‘‘Two-Factor Theory” is a case in point. In order 
to establish a scientific methodology that would raise certain controver- 





1The mathematical portion of this paper is mainly the work of the second 
author. 
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sies in the Mental Test field above the level of mere opinion, a new 
approach to the problem of validity was demonstrated in 1904.! This 
approach established the view that the tendency to rely upon a purely 
empirical interpretation of validity,—as followed, for example, by 
those who would be content with the fact that ‘intelligence tests” 
work in practice,—merely represented a recourse to obscurantism, or 
something worse. At the same time, any attempt to go beyond this 
empirical stage towards the truly psychological problem as to the 
nature of the function measured, (the “functional” validity,) was 
doomed to failure, since the controversies thus engendered could 
never be overcome without an adequate methodology. The truly 
“objective” validity could not, prior to that time, be determined. 
Hence the logical approach to the study of mental constitution through 
the field of Individual Differences, involved the discovery and demon- 
stration of a method that would determine the various “abilities” or 
“factors” in a rigidly objective manner. The functional and empirical 
aspects would be later considerations. 

This means that, fundamental to a scientific approach in the field 
of psychology is an understanding of the techniques designed to 
measure degrees of correspondence between functions.?, The mathe- 
matics of correlation must be understood and developed to meet the 
needs of psychological science. Galton implied the same thing when 
he said that a better understanding of the inner constitution of mind 
should be possible;? and Spearman has given concrete expression to 
Galton’s idea, and has shown that upon a rigidly objective foundation, 
a scientific account of functional psychology can be built.‘ This 
latter task involves all the intricacies of the mental realm; but the 
methodology makes them amenable to objective validation. 

The psychologist would be justified in assuming that the validity 
of the mathematical techniques involved, was not his immediate con- 
cern. Just as we accept the formulae of the natural scientist and 
apply them without necessarily understanding their derivation, so 
we might employ statistical devices in evaluating our data without 





1 Spearman, C.: The Measurement of Association between Two Things and 
General Intelligence—Objectively Determined and Reasured: American Journal 
of Psychology, Vol. XV, 1904. 

2 Spearman, C.: Loc. cit., p. 71. 

? Inquiries into Human Faculty, (Everyman’s Library, 1919 Edition, p. 17). 

‘The functional aspect is treated fully in Spearman’s ‘‘The Nature of Intelli- 
gence,”’ etc. (Macmillan, 1923), where the doctrine of Neogenesis is presented. 











is 
ly 
vy 


99 


or 
is 


he 
en 
nd 
yn, 
she 
ity 
n- 
nd 
put 
and 


‘nal 


elli- 


The Two-factor Theory 197 


concerning ourselves unduly with the reasoning on which they are 
based, especially when such devices have been checked by the leading 
experts in the field of mathematics. We must, of course, understand 
them sufficiently to be able to apply them without error. But beyond 
that, we may undoubtedly accept the substantiation of a Filon,' or 
of a Wilson.’ 

Nevertheless, since the mathematical aspect plays so important a 
part in the epoch-making work of Professor Spearman and his school, 
the student may be pardoned for a feeling of discomfort if asked to 
build on a foundation that is, to him, unknown. With this in view, 
the material which follows has been prepared. It constitutes the 
first steps of an attempt to present the basic theorems underlying the 
factor theory, in a manner that can be appreciated by those unac- 
quainted with the higher mathematics. To the expert, there may be 
certain unavoidable limitations in such a procedure; but, it is hoped, 
no false steps in the reasoning are involved. The application of the 
formulae to typical psychological problems has not been considered. 
Such an extension of the Appendiz to Spearman’s Abilities of Man has 
already been made available by Holzinger.* An amplification and 
simplification of the main sections of the Appendiz is here presented, 
to assist in a more comprehensive understanding of the factor theory 
and of its far-reaching possibilities. For those who are interested in 
the more technical presentations of these theorems, and of the con- 
troversies that at one time surrounded them, a working bibliography 
is appended. 


2. PLAN OF THE WORK 


The significance of the sequence here adopted is indicated by the 
following outline: 
Section A 
(a) Introductory statement concerning the meaning of ‘‘Correla- 
tion.” 
(b) The development of a formula for determining the coefficient 
of correlation. 





1See, for example, Proceedings of Royal Societies of London 1922, Vol. 101 A, 
pp. 94-100. 

? Wilson, E. B.: Science, March 2, 1923. 

* Holzinger, K. J.: Statistical Résumé of the Spearman Two-factor Theory, 
U. of Chicago Press, 1930. 
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(c) The effects of errors upon the coefficient of correlation, (r). 
(d) Hence the need for “correlation” of r. 
(e) The Correction for Attenuation. 
(f) The Tetrad Criterion, (as derived from e). 
(g) The significance of the Tetrad Criterion. 
(h) The Reliability of the Tetrad Criterion. 
(t) Freedom of the Tetrad Criterion from the influence of Attenua- 
tion. 
Section B 
(a) Alternative method of correcting for Attenuation. 
(b) Alternative method of deriving the Tetrad Criterion ;—Partial 
Correlation. 
Section C 
Symbols used in this paper. 
Section D 
Bibliography for technical study of the Two-factor Theory. 


SECTION A 


(a) The Meaning of Correlation as Applied to Mental Abilities.—The 
term “correlation” refers to the degree of inter-dependence between 
two mental functions. Expressed quantitatively, perfect correlation 
is signified by a coefficient of 1, perfect independence by 0. For 
inverse correlation the coefficients take similar but negative values. 
On this basis it is possible to express the relationships between func- 
tions or abilities in simple numerical terms. The main relationships 
met in the field of cognitive abilities are positive, and range from 1 to 0. 

The most frequently used expression for determining the coefficient 
of correlation (r) between two functions, is that known as the Pearson 
Product-Moment formula: 


rary 
NG Fy 


(I) 


T = 





The immediate interpretation of this formula may be obtained by 
consulting the table of Symbols employed throughout this paper. 
(Section C, page 218.) We are here concerned with a logical interpre- 
tation of Equation (1), which is best arrived at by showing how it is 
derived. 

(b) Development of a Formula for Determining the Coefficient of 
Correlation.—Suppose we wish to determine the degree of correlation 
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between Ability A and Ability B, (e.g. between “arithmetical” and 
“mechanical” abilities), as measured in a sample of Grade 8 children. 
Suppose further, that the scores obtained in tests of A and B are 
indicated by the distributions z and y respectively. We may plot 
the scores obtained by the individuals in both tests on the same dia- 
gram, by representing the z variable horizontally, and the y variable 
vertically, and thus construct a scatter diagram, or a correlation table. 
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X-variable 


Before comparing the z and y distributions, we must reduce them to 
comparable units; 7.e., we must change the point scores to their values 
in terms of the equivalent amounts of deviation from their means. 
This is done by dividing the deviations (in points) from the mean of 
the distribution by the standard deviation. Thus an z deviation in 
the “‘arithmetical” distribution equals z/c, in terms of its standard 
unit; and a y deviation in the “mechanical” distribution equals 





200 The Journal of Educational Psychology 


y/o, in terms of the corresponding unit. We may now express the 
ratio between any corresponding pair of deviations. 

Let r: denote the ratio between any z deviation in a row and its 
corresponding y deviation, each having been reduced to sigma terms. 
It is obvious that the y value for a row is constant. 

Let n denote the number of z deviations in a row. 

For a single z deviation in a row: 


x ? 
— re 
Cs Cy 
or 
Cs , 
z=r1-y (i) 
Oy 


Summing for a row: 


a o 
Zs = zriy 
y 


2 . 2 
n N Oy 
But 
= 
= = the mean z of the row, 
and 
ca = the ratio of the mean x of a row to the y of that 


row, which value will now be called r, since this is the 
coefficient required. 


Therefore, in any row: 
Mean r= r—y (ii) 


This equation measures the regression of x on y. 
Returning to equation (i), for a single deviation in a row 
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For a row 
o 
Lazy = =Iri—y’? 
Oy 
Dri Cz 
Sry = n— 27/2 
ty N Gy 
Sry = r—ny? 
Gy 
since Dri/n = 7. 


For the whole table: 
>(Zay) = r“2(ny?), 
Gy 


since r, the slope of the regression line xz on y, is constant for the 
whole table. 
But obviously 


y(Lry) = Try 


and 
r(ny?) = Ly?. 
oC 
Ley = r—Ly? 
Ty 
Oz 
= T—N > Oy 
Cy 
Since oy = VY dy?/n. 
=r Mg, 
Lary pry 


~ nawey  \/ Sat Sy = 

This expression gives the coefficient of correlation in terms of x 
and y deviations. 

In the above process, we passed from the equation for a row to an 
equation for the entire table, by the process of summation, and assumed 
that the means of the z scores of the rows lie approximately on a 
straight line, (the regression line); we also assumed that r, being the 
slope of that line, is constant for all the rows. By a similar process 
we can find the regression of y on z: 





Mean y= rove (iv) 


and from this equation we may deduce as before, that r = Zry/nowy. 
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The important function of the regression equation is that of predic- 
tion. Given a child’s ‘‘mechanical” score, we are able, by means of 
equation (iv), to predict his probable ‘‘arithmetical” score, 7.e. the 
average score in arithmetic corresponding to the given mechanical 
score. It should be noted that the coefficient of correlation in itself 
has no predictive value. 

(c) The Effects of Errors upon the Coefficient of Correlation.—We 
may examine the influence of errors upon both numerator and denomi- 
nator of the formula for r derived above: 

‘We saw that r = Yzy/no.c, for true values of the variables in- 
volved. Let the observed values be represented by 


ys 
wry 
T1 — aetiyi 
NGz Fy, 


Consider first the effect of errors on the numerator of the coefficient 
(Zzy): 
w= 2 
Y= y 
oo Zi = TY + Tey, + Yes, + Ca Cy, 
“.201Y1 = Lry + Urey, + Dyes, + Vez ey, 


I+ I+ 


Cz, 
Cy, 


If we assume that there is no correlation between either z or y 
and the errors, and that there is no correlation between the errors 
themselves, then 


zrey, = O 
Similarly 
yyez, = 0 
and 
Zez,ey, = 0 


~.201Yy1 = Try. 
Thus we see that the numerator of the coefficient remains un- 
changed. 





1 The simple proof that the two regression lines pass through the mean of the 
distributions has been omitted in the above development. This proof may be 
found in any good text-book on Statistics. Cf. Yule, Udny: ‘An Introduction 
to the Theory of Statistics,”’ p. 170. 
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Turning to the effect of errors on the denominator (no,0,) of the 
coefficient, we arrive at a auite different result: 


os = _ 
n 





and 
. Dx? 
Oz eS 
n 
D2" 
oz, — f : 
n 
and 
g- 221 
Tz, Re EEE 
n 


m=zrt és, 
z*, = 2* + 2rez, + es, 
yr*, = Tx? + ZFrxez, Lez, 


Applying the same assumption as before, regarding the correlation 
between x or y and the errors, and between the errors themselves, 


yx*; = Zr? + Lez, 








4 2 2 SF p2 
ae - zz 4 ze*s, 

n n n 
“072, = 07, + 07... 


..07,, > o?, and hence az, > az. 
Similarly o,, > cy, so that noz0,, > nowy. 


The denominator is thus increased by errors, while the numerator 
remains unchanged. Therefore Y2.y:/noz0,, is always less than 
~ry/no wy, and consequently r; is always less than r. 

(d) The Need for Correction of r—From the foregoing it is obvious 
that errors of observation always have an attenuating influence upon 
the coefficient of correlation. This being so, the need arises for some 
device whereby the extent of that influence may be determined, or 
else whereby the errors may be eliminated, since our major objective 
is the accurate determination of degrees of correspondence between 
abilities. To this task we may now proceed. 

(e) The Correction for Attenuation.—Let us begin with the expres- 
sion for r in terms of true measures, and convert it into an expression 
in terms of observed measures. We have: 
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22x 
Psy = - y (true measures) 
ry 
and 
D271 1 
Pa i y (observed measures). 
NO 2,0 y; 


Required: r,, in terms of observed measures. 


I. ANALYSIS OF THE NUMERATOR (2ry) 


We have already seen that Sry = Uryy, (page 202). By exactly 
similar reasoning, we may determine the value of Zzy in terms of other 
sets of measures of the same abilities, e.g. x2 and ye. Thus 2; and x, 
may denote the measures for two tests of discrimination, y; and y, 
those for two tests of intelligence, while x and y represent the true 
measures. So that, expressing 2zy in terms of the various combina- 
tions of measures, we have: 
s — FF 
wed eh. (ay) = UX1Y1 “ LxLeYe. (i) 
Try = Tree 

Sry = LIriWe2 


ys “ \ (ey)? = LxryYy2° Urey. (ii) 
aIY = “aT2y1 


Il. ANALYSIS OF THE DENOMINATOR (no.0,) 


Syr2 | Fy2 es 
Noy = mm ov = ~/ >r?dy? 


n n 
As before: 

ri =r t éz, 

Te = 2 Cz, 


= 
. Byte = Z* + Ee, + Bes, + Cole, 
.241%q = Lae? + Vrezr, + TDresz, + Vezez, 


Again applying the assumption that the errors are uncorrelated, 
and that they are independent of z and y, (page 202) 


yzxrer, = 0 
and 

~rez, = 0 
and 

Lez,€z, = 0 
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In a similar way we find the value of Zy? in terms of y; and ye, 
and obtain the result: 


LyiY2 => Ly’. 
Da? Zy? = Laire: Dye 
A Sx? dy? = V SaizeDyiy2 
..The Denominator, nowy = V/ Drizelyy2 








We may now express the value of r,, in terms of 21, 22, yi, y: thus: 
























































>> 
— V TriyilTys (i) 
V SriteDy ys 
or 
_ WS Sry2lreyi - 
Tey = (ii) 
V/ T2122 - LyiWe 
But 
. = 221 = zr ' 
oe Noy, — A/ Say Dy? 
whence 
Tri = Tey,’ V 2212+ Sy? 
And similarly 
ZUX2Y2 2x 12 a 
ta = =e Teg, =n? Teg, = 
Us Sat Sye VJrartre  W/ Sy Bye" 
Substituting these values in equation (i), we get: 
— Nas V >r:?- Zy1") (Tes 222? Dy2?) 
Tey — ————————— 
Nae V 222222) (rye -V Dy Dy2") 
_ Vian: Tews (i) 
VTi Ty Tyiy: 
And similarly, from equation (ii), 
WTam* Tan Tey: (ii) 





ley = 
VTrasr* Tuy Tyiye 
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Dividing (i) by (ii) we obtain the equation 


t = WV Tey: * Tey: — Tam’ Teas 
Jf Tay! toy Tay. * Tam 
- Tay . T zeys ”- Teiys . Tzay- 


This is the Tetrad Equation. 
Multiplying (i) by (ii) we get 


VTray * Troy: * Tay: com 
T 2,227 vive 


4 

“r _ (rev: "Tey: * Tzy2° T x14)” 
ee zy — , 
(Te,2y ° Tyv2)” 


This is the formula for the Correction for Attenuation, given by Spear- 
man in The Abilities of Man. (Appendix, I.) 

(f) The Tetrad Criterion.—In the preceding section, it was noted 
that the Tetrad Equation forms an integral part of the reasoning on 
which the correction for attenuation is based. Later on, (Section B) 
we shall consider the development of this equation from a different 
angle. Our task at this point is to state more clearly what the tetrad 
criterion is, and thence we shall pass to a brief consideration of its 
significance (Sub-section g). 

In the formulae given on page 202, x, and x2 stood for two sets of 
measures of function z, y; and y2 for two sets of measures of function 
y. It was shown that 











a 





Tay, : Tau, 7 Tay, ‘ Toy, 


This expression would be true if x and y were correlated perfectly; 
i.e. if they represented identical functions; for then, by the “‘attenu- 
ation’’ formula, we should have 


” 
T — 1 ae (T2y,72,u,0 2,0," 2,v,)"4 
zy o— _. 

(Te 2.7 v,v,)” 


(a) 





Hence 
% _ 46 
(Tey Tel eu l2,v,) = (rz, zl vy,) 


and since, by hypothesis, x = y, yi and y2 are measures of x. Conse- 
quently we may interchange x2 and y;, and get 


V4 4 
(227 vvTz,v,"z,v,) ‘= (Yau fe.0)” (b) 


Similarly we may interchange z2 and yz and get 


4 
(rv,P2,v,72,2,7v,v,)” = (r2,vtv,2.)’ (c) 
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Dividing b by c, 
(estan) _ (raafesfasyee)" 
(r2.07v,2,)” (re v,T2,v,0 2,27 v,v,)" 
When simplified, this becomes 
Toy Tey, = T2v,"v,2,- (Appendix I, 2.) 








An experimental demonstration of such functional identity was 
given by Spearman in 1904,' in which the tests used were of many 
and varied types in the realm of cognitive abilities. Using his main 
result in illustration of the above presentation, we may take our 
symbols as standing for the following: 

z, = discrimination of musical pitch. 

z2 = discrimination of visual length. 

y1 = power to succeed in school work. 

y2 = intelligence as measured by tests. 
It can be assumed that x: and z2 are measures of some ability z com- 
mon to both discrimination functions—the measures involving, how- 
ever, certain errors ¢éz, and €z, respectively. Similarly yi and ys 


may be said to measure a common function y, with errors ey, and é, . 


The correlation r,,, corrected for attenuation, may thus be deter- 
mined; and Spearman’s results proved this to be practically perfect. 
On this basis, therefore, there is ground for the hypothesis employed 
above in illustrating the meaning of the Tetrad equation. [Appendix 
I, 1.] 

(g) The Significance of the Tetrad Criterion—The foregoing indi- 
cates that, when each of four variables, (z:xey1y2) can be divided 
into two factors each, the one being common to all four, (z or y), 
whilst the other is in each case specific and independent, (ez, €z,, 


ey, ey)» the tetrad equation obtains. This has now been investigated 


over a very wide range of abilities, and its empirical demonstration 
may be said to be beyond question. The reverse problem, namely, 
as to whether, when the tetrad criterion is satisfied, then every variable 
would necessarily be divisible into the said two factors, is much more 
difficult, and will be considered in a subsequent paper. Its solution 
has been achieved by Garnett, and by Spearman.? 





1 American Journal of Psychology, Vol. XV, 1904, pp. 268-272. 
? Garnett, M.: Proceedings of Royal Society, Vol. LXXXXVI, A, 1919. 
Spearman, C.: Proceedings of Royal Society, Vol. CI, A, 1922, and Psychology 
Review, 1920, pp. 167-168. JAppendix I, 3.] 
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The psychological significance of the tetrad technique is most 
clearly portrayed in The Abilities of Man. It offers a method of 
analysing the sub-structure of mind in a rigidly scientific and objective 
manner, and supplies the firm foundation of a meaningful functional 
psychology. The objective and functional aspects of validity open 
up a most promising approach to the entire field of mental tests, and 
make possible the true cooperation of Individual Differences and 
General Psychology. This being so, the practical applications of the 
factor-analysis method are obvious. 

(h) The Reliability of the Tetrad Criterion—We shall here attempt 
to analyze the formula given by Spearman for determining the stand- 
ard deviation of the tetrad difference. Taking, as a typical tetrad 
difference, risr24 — 723714, the standard deviation has the picturesque 
value 


l 
Vins + rig + 723 + r2o4 — [(rieristes + Tie uares + 


TisP asa + Tes eaTsa) + 47 ish aT es a4! 


Let Tis24 (Or TrisTe, — T2a%14) be a tetrad representing the entire 
population, and further, let us assume that the true tetrad difference 
equals zero. In order to obtain an expression for the reliability of any 
observed tetrad difference, we must replace each r by r + dr, where dr 
is;the ratio of change that r undergoes due to inadequacies of sampling. 
It;may increase r, or it may decrease it. 


T1324 = TisT24 — Test 14 (A) 
Tis24 + dises = (Tis + dr3)(T24 + dros) — (T23 + drs) (Ti4 + dri,) 
= T13%2a + Arisroa + Aroatis + drisdros — Testa — 
Ariaro3 — Arostig — Orosdriu 
Subtracting 7'1324 from both sides, and neglecting the second degree 
terms of d’s, (which are of the order 1/+/N and small, when N is 
large) we have 


disea = Orishoq + droatis — Arigre3 — resrig (B) 
Squaring Equation (B), 
d* 1394 = dr°2s??13 + dr’is?724 + dr714?723 + dr’esT714 (C) 
+ 2dreadrisristes — Zdrogdr yar isres 


= dre? i3dres 14 —™ 2dr y3F y4P 24723 
= 2dr j3dre3f 247 14 + Qdry4dro3r 4722 





1 Abilities of Man, Appendix II, 3. 
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Summing, and dividing by N (the population). Equation (C) 
becomes 


Zd*i304 — Ldr*agr*i3 | Vdr*isr22g | Ddr*sur?ss 



































en = ee + a fe (D) 
Ldrosr714 Lr sr isto 
dies etiam 
_— 22dradrurisres _ 92 Er udr ear 13 
N N 
_ 22drisdrurores _ 92dr 13d o37 aT 24 
N N 
Ldrydresr os? 14 
+2 N 
To Simplify 
Sdrtor? 1 — [%.)2r2 
. eee = o%,,rt, = ( + cae 
The next three terms may be simplified by a similar process. 
To simplify ae oo, multiply numerator and denominator 
by ¢,,0,,,, and we get 
ozarudris * Tisloaer, Fr. = 2P page isl 137 240 ra F 743 
No,.6r;; 


The next five terms may be simplified similarly. 
We may now write for Equation (D) 


Ld? 1304 D 
N _ o7,.,7713 + a”, 7704 + o*,., 7723 + em (EB) 





+ 27,,4r,,7137 2407, Or;; 
is 27 rege 7 137 230 1507, 
= 27 rscrag? 137 14017, F124 
— 27 p97, 724 230 7,0 r,, 
— QP y, sras7 247 140 2, Fr 4 
+27, eral 287 140 1 F 15; 


Values for r,, may be obtained by utilizing the formulae of Pearson and 
Filon. Values for o, are easily deduced from the generally accepted 


9 1l-*r 
reliability formula, ¢, = ———- 


vn 
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The Pearson-Filon formulae! for correlation between correlations (r,,) 
are: 


y = ~ 7 T 12123) (724 aa TesT 34) + (r14 = 113134) (Tes “7 1211 13) 
— +(ris re 14134) (24 ms T 127 14) + (ri4 - T1214) (Te3 = T 247 34) 

















271 — r12)(1 — r®m) (I) 
™ _ Tre13(1 + 2rierestis — 1212 — 1223 — 1715 
Tries — 123 2(1 —— 710) (1 — 713) (IT) 
We may now write, instead of Equation (£): 
1 — 22 1 —— 22 — 2 
O retra = ( a P13 + ( — r'24 + ( a 23 
a 24-2 

(1 — r'14 (F) 


+ 24 (Tie ond Toa 14) (134 -~ T1471 13) + (Tes ™ rie 13) (T14 = — 
+ (ria — Tresris)(T34 — Teat23) + (Tes — Pearse) (T14 — 113734) 
2(1 — r°o4)(1 oan 713) 





rate ze ru) (A — 11s) 
VNV/N 


+ (Similar expansion of the 6th term in equation E) 








—22rie — reatua(l + 2reqrietig — 7724 — 7712 — 1714) rials ° 
= 2(1 — r2o4)(1 — 1714) — 
(1 — ro) (1 — ry) 


VNVN 


— (Similar expansion of 3rd, 4th and 5th terms in equation EZ). 

By expanding equation (F) and assuming that the tetrad differences 
(true) are zero, so that risre4 = rieTs4 = 114723, We may then collect 
terms, and, by ordinary arithmetic obtain the following: 





1 
OTetrad = ys + 7744 + 7703 + r24 — 2(risl 127 23 + TioF 147 24 
+ Tish asa + TosPoaTsa) + 4risruarestes] (G) 


(t) Freedom of the Tetrad Criterion from the Influence of Attenuation 
(Abilities, J, 4). Granting the fact that the tetrad criterion holds for 
“‘corrected”’ correlations, does it also hold for the corresponding ‘‘raw”’ 
Correlations? 





1 Philosophical Transactions, Vol. CXCI A, 1898, pp. 229-311. 
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Let 2, 1, 21, W: be any four uncorrected correlations forming a 
rectangle in a table of correlations; e.g. 





xX Y Z W E 














xX — 

Z Y zy21 Yue saat 

W Y 2,0, Y yyw 

E on 























Let x, y, 2, w denote the corresponding corrected or true values. 
We have assumed that rzz° Tyo = Tyz* Tew 


Our question is, will the equation hold for the raw values, 7.e. does 
Tayz f Ty yw, = Ty, 2° Trywo,? 


Let us first reduce the terms of each expression to deviation form, 
and then compare the results: 








Lx@z ‘ 22121 
z= SSS 221 = = 
V/ >x?d2? L21? - Tz," 


and similarly for the other terms. 
For true values, 











Tos * Tao ™ Tos * Tan (1) 
2rz° Lyw Zyz° Zxrw 
oS — = —- (2) 
V rx*t22Zy2tw? Sy? D2? Tz? Zw? 
*, Drz- Syw = Lyz- Tew (3) 


In the analysis of the correction formula, (Sub-section e) it was 
shown that the sum of the products of the pairs of deviations of the 
true values of two variables, equals the sum of the products of the 
pairs of deviations of the observed values (p. 204). 


Therefore: 
2272 = 22121 LSrzw = T2r1wW) 
Zyw = Lyiw1 Lyz = Lyizi 


Substituting equivalent observed values for true values in equation 3, 
we get: 


LarizZ1° Dyyww, = TyizZi° VIT1WwW, 
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Dividing through by +/ 22,22y.?5z;?2w,? 














22321 ° Dyiwy, a LYizZ1° VLW1 
VY 3212 d2;?2 Dy Tw? Dyi2Dey2 Dz Dw? 
or 
Peyz,° Tyyw, = Tyy2, ° Tz, 


In other words, the criterion holds for uncorrected correlations when- 
ever it does so for the corresponding true values. (In Section B, (a), 
an alternative method of proof is offered, the problem being approached 
there from the opposite angle, namely, ‘given that the tetrad equation 
is satisfied with raw values, does it follow that it holds for the true 
correlations corresponding?) 


SECTION B 


(a) Alternative Method of Correcting for Attenuation..—Method.— 
Several measurements are taken of each individual in the two abilities 
to be correlated. The measurements for each individual, and for each 
ability, are then arbitrarily divided into two groups, so that the 
difference between the group averages for an individual is purely 
“‘accidental.”” Such a division may be made by taking measurements 
1, 3, 5, ete. for group one, 2, 4, 6, etc. for group two. 

We many safely assume that the “‘accidental”’ errors in one ability 
are not correlated with those in the other, and that such errors are not 
correlated with the true underlying values corresponding to the 
measurements observed. We may also assume that the averages of 
the true underlying values of the two groups and of the sum of the two 
groups are all equal. Thirdly we may assume that the sum of the 
squares of the individual averages in one group is not very different in 
magnitude from the sum of the squares of the individual averages in 
group two. Symbols employed in this analysis (deviation form) 

Let x denote the average value of the underlying the true measure- 
ments. 
x, the average of the measures for an individual in group 1 
z,* the regular underlying measure corresponding to x, 
d, the accidental error 
212 the average of the obtained measures in both groups. 
xi. the average of underlying true measures for this individual in 
both groups. 
d2 the accidental error. 





1 Spearman, C.: British Journal of Psychology, Vol. III, p. 288. 
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Let xor2'd_ denote similar terms for the same individual in group 2. 

Let yryi'eiyisyi2'€i2Y2y2'e2 denote similar terms for the same 
individual in the two groups; obtained from his measurements in 
respect of the other ability (y variable). 

We may now state our assumptions and equations concisely: 
Assumptions: 

(1) z'; = z's = x'h2 (see page 274 in the article cited) 

(2) z = zy. since the average of the various deviations from the 

average (x!) must equal zero. 
(3) LYaze, Lyd, Xde, etc. 
(4) Xd,? and =d,* are very nearly equal. 
and ~z,? and Yz,? are very nearly equal. 


Equations given: 
QZ, = z'} + dy = 22 + d; and since z'\, = z 


%1 =— Z + d, 
lie = 2+ di 
t2a=2r+d, 
y=KHyte 
Y2o= yt ee 
Yin = Y + Cre 
Then 
Zi2yi2z = (x + dio) (y + €12) 
LLiYi2e = U(x + die)(y + 12) (i) 
Q1t2 = (x + d,)(x + ds) 
22122 = x(x + d)(x + dz) (ii) 
yiy2 = (y + e1)(y + ee) 
Lyiye = Uy + e1)(y + e2) (iii) 


Multiplying (ii) by (iii) and taking the square root: 
[Zxsze- Syry2]”* = [E(x + di)(x + de) Z(y + ex)(y + €2)]* (iv) 
Dividing (i) by (iv) 
BLY ie i D(x + die) (y + e12) 
[LaiteLyiye|’? [D(x + di)(x + de) Z(y + e1)(y + e2)|” 


a L(xy + xreie + ydie + €12d12) 
[Z(a? + rde + xdy + dide) Z(y? + yer + yer + e1€2))”* 


zry 
a/ S22 dy? = Tzy (v) 











= (By assumption 3) 





We now have an expression for rz, in terms of group deviations. 
Our task is to reduce the left hand side of equation (v) to a convenient 
form. 
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If we can express r., a8 a function of rz,,y,,, We Shall have a formula 
of practical value which will correct for accidental errors. 

It is clear that the denominator of equation (v) (left side) must be 
reduced to the form [2z?122y?12]” if we are to get rzy in terms of rz,,y,.- 


Sayin _ = Stayin |: [2712 Zy*aa] 4 
(SrizeDyrye)®  [LVaxr%.2Dy*2)* [VrizeLyrye]” 


- Lr* 12 Dy" 12 —_ (eee 
« ©! zy Zi2y12 Lxyx2VyiY2 Zi2yiz Dr122 Ly we 


5 {7 + a2\’ 
i ia 2 
22122 2I1L2 


_ ms + 2Eziz2 + Taz? 


7 4 22122 


= 1 2x1? ot Dx2” 
a VAC ” 222122 ) 
By assumption 4, 2z,? = 22,’. 


7.21? + Dare? = 2Dr12 = 2 Taj, Tx, = ZW Tarirz2 


de <a ee) 
“"Nyriz V2 2>2x122 


( ) 
1 22122 

=4 | “y+ = 
2 / 221222" 
Similarly 


Ly" 12 . 1 l 
— Pee 
jz# i T 4 
7 Vay = rene 3(3 v wal? + 2 

















But 




































































= Taye" ; ? v( + maa + =) (given thus in the 


article cited) 





% 4% 
=f (1 + Pay) 81 + Tw) (given thus in ‘Abilities’ 
I. 4.) 
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It is again obvious that, if each observed correlation is to be 
multiplied by a factor as indicated in this last equation, the Tetrad 
Difference obtained with observed correlations will not be affected by 
the correction. 

(b) Alternative Method of Deriving the Tetrad Criterion: Partial 
Correlation.—Before showing how the Tetrad Equation is related to 
Yule’s well-known formula for Partial Correlations, it is advisable to 
indicate how that formula may be derived. 

In the case of two variables, there are two regression equations: 


r= rayy—the relation of any mean z to its particular y. 
y 


y = Ty2x—the relation of any mean y to its particular z. 


ox 


Oz. ° ° 
rzy— is the regression coefficient of x on y. 
Cy 


Cy . ° ° 
Tyz— is the regression coefficient of y on z. 
o 


z 


r?,, is the product of the regression coefficients. 


Let us suppose that there is a third variable (z) which influences or 
is correlated with x and y—and is active. Our problem then is to 
eliminate the influence of z. We must therefore first take account 
of its influence on the degree of correlation between z and y. 

Let us, accordingly, develop: 


(1) an expression for the regression coefficient of x on y independent of z, but 
in terms of known values, i.e. in terms of the two-variable coefficient rzy. 

(2) an expression for the regression coefficient of y on z similarly. 

(3) by the product of these two coefficients, an expression for the square of the 
correlation between z and y with z controlled, (r?zy..) in terms of rzy. 


In the case of two variables, the means of the rows of z’s were 
represented by the ‘‘best fitting’ line. With three variables, these 
same means, which were assembled in a line, are now scattered up and 
down in a plane passing through that line. Their various positions in 
that plane mark their z-ness. Approaching the problem by the same 
method as before, we see that, for a single z deviation in a row 

Oz oO 


> om wf 1 ad 
r= PT ey. + 1'ss-y—2. 
zy-z yf %. 





ne ee ERI oe 


ae Re omen 
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For a row 





2x ok Or" wee Cz + Br" ony Gs, 








n nN Gy 8 
O z-yz Oz-ys 

Mean & = Pays Y + Pee-y— 2 (1) 
Oy-zs Oz-zry 


Here we have an expression for the sum total of the mean zr devia- 





tion with respect to both y and z. The term Top expresses the 


y°zz 

relation between the z deviation of a mean and its particular y deviation 
for all values of z, t.e. it is independent of z. Shifting up and down the 
plane does not alter the z or y deviation. It simply adds another 


deviation, that of z. Likewise Teoy expresses the relation between 
Z+zy 

a mean z deviation and its particular z deviation independent of y. 

In equation (1) we have an expression for the regression of x on 
y and z stated in two terms—independent of z in the one term, inde- 
pendent of y in the other. We need an equivalent expression for the 
regression of z on y and 2z, but in terms of r,, and r,,, with z and y 
exerting an influence. These are correlations which we can readily 
obtain experimentally. 


Os. Oz. o fo. 
Let us then reduce r.y.2—— and r,;.,— to ry-— and r,,—; #.e. to 
Oy-zz Oz.zy Oy Oy 


known terms. 
Transposing, in equation (1) 


Ox. yz Oz. yz 














Zz = — CS es 2 
Tos ant y z r ty (2) 
O z-yz 2 on =— Oz-yz 
Pavey ry Toes (3) 
Pye ze et Sy? = Lry — Pasy Dyz (4) 
z-y2 yrzrZ 


But 


Ly? = no,?. 


Oz-ysz Day Oz-yz Lez 

















 Naey-2 = — Trey 9 (5) 
Oy-ze no»? Oz-ay Wy” 
7 Cee Bey Fz _ 4 Taye Uys Oe (6) 
zy-z —_ _ za-y ; 
Sy-08 NDy Cy Or-.zy NO Yr Ty 
Oxz-yz Oz Oz-yz Oz 
l zy-2 a = Tay — Tes-y~ Ta (7) 
Oy-2z Cy Oz-zy Oy 
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(deduced from equation (7) by an obvious shifting of subscripts.) 








° ° ° Cz. P ° 
Substituting accordingly for r...y,—“ in equation (7) 
2-zy 
Ox. o Co og 6: 0 Oz. 
T 2y-s —— = Tsy— ast Tye To + ro —Psy.2—— (8) 
Oy-ze Cy Cy Cz Ty: Oy-ae 


Transposing the unknown “‘ partial’ to the left: 











Or. Co 
(Tees tes oS ena) <= (Tey al PT 2sT yz)— (9) 
Oy-zs Cy 
or 
Cz. o 
Yry-a(1 _ r*..) "= (Tey somes T xe¥ ys) — 
Cy-zs Cy 
. Oz-yz Tey — Tez* Tye2%x 
a ee = _ 
ry-2 "ago Tm: re oy (10) 


The Regression Coefficient of z on y, independent of z, and in terms of 
known correlations. 
By obvious shifting of subscripts, the regression coefficient of y 
on z is 
, Oyzre  Tyz — Pal yz Ty 
ae” 1 — ros Oz 
gt (Tey — TzTy2)” 
o_o Ud— Fai — r,) 
Tey —~ Vel yz 


— V1 — aA | ae ry, 


The partial regression equations may now be written in terms of the 
regression coefficients just obitained: 





(11) 





























Tey — Val yz Fx Vee —~ VayTee Ty 

+ = 2 —" + 2 oo oe (1) 
1 —? yz Ty 1 — y yz Oz 

wl Tyz _ Ty2l xz Oy VT yz ies Tyal zz Oy . y) 

i 2 3S + 1 2 a ( ) 
l-fr zz Oz — Ta Be 

r — £. T: Cz Tey atid Vyzl ez Cz 'en* 

z= — —— att —- 'Y (3) 
1— Yyz Ox l—rsz Oz 


These regression equations are useful for predicting the values in 
one variable from given values in the other two. 
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(The correlation between this predicted value and the observed 
value is the Multiple Coefficient of Correlation.) 

Let us now show the validity of the Tetrad Equation by applying 
the formula derived above. 


Let rzy,.9 denote the correlation that z, would have with y; if the 
influence of g were eliminated. Then, 


Ten —~ Val yg 


rT ted sew nnn ren pane 
_ V1 se raoWV 1 = Ton 








But 
Try,-9 = 0 
- Pa, .¥, = Tzol y9 (1) 
Similarly 
Taw, Yz0T yo (2) 
Tam _ "a9 
Tay Treg (4) 
Similarly it may be shown that 
Taw, _ “ao (B) 


lay, Vag 
From (A) and (B) it follows that 


Taw pore Tarys 
a a? 


Te.u1 T zov. 
or 


Vey lay, = Voy72zy. (Appendix I. 2.) 


SECTION C SYMBOLS USED IN THIS PAPER 


Except where otherwise indicated, the following interpretation 
of symbols holds good: 

X a true score, variable 1. 

Xm the mean true score, variable 1. 
x the deviation of a true score (X — X,,), variable 1. 
Y a true score, variable 2. 

Y,, the mean true score, variable 2. 
y the deviation of a true score (Y — Y,,), variable 2. 


r the coefficient of correlation between true values of variables 
1 and 2. 
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X;, an observed score, variable 1. 
xz, the deviation of an observed score, (X — X;,,) variable 1. 
Y,, y:1 similarly for variable 2. 
r, the coefficient of correlation between observed values of 
variables 1 and 2. 
the error in 7}. 
the error in 7. 


Cz 
Cy 
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THE HEINIS PERSONAL CONSTANT AS A SUBSTITUTE 
FOR THE IQ 


PSYCHE CATTELL 


Harvard University 


There is a growing dissatisfaction with the IQ among psychologists. 
As data accumulates the belief that there is a constant tendency for 
the IQ of the dull child to decrease with increase in age while that of 
the bright child increases is becoming more prevalent. Dr. Kuhlmann 
has recommended Dr. Heinis’ Personal Constant'? in place of the IQ 
for use with his tests. He writes: 


That it has been known for some time that the IQ as found by tests that give 
correct mental ages at all mental levels does not remain constant throughout 
successive years except for children with initial IQ of 100. Intelligence Quotients 
below 100 tend to decrease, and above 100 they tend to increase as the child grows 
older. The amount of change per year depends on the initial IQ and on the 
chronological age. The total rate during the mental growth period may exceed 
25 IQ points. The change is due to the fact that the rate of normal or average 
mental development slows up with increasing years.* 


Combining his own data with those obtained by two other investi- 
gators, H. Heinis secured data on the mental development of children 
ranging in age from two to eighteen years. From the material he 
constructed a formula‘ for a curve of mental growth and a coefficient 
which he terms a Personal Constant, to be used in place of the IQ. 
Since the Heinis Personal Constant (H.C.) ‘is based on the same 
intelligence tests as the IQ, in fact on the mental age derived from the 
identical test performance, it is, of course, not expected to correct the 
variable fluctuations in the IQ. It is, however, claimed that it 
remains more constant than the IQ at the upper and lower levels of 
intelligence. 

In order to make a comparison of the H.C. (Heinis Personal 
Constant) and the IQ, the records of three groups of children were 


? Heinis: H. La loi du Developpement Mental. Archives de Psychol., No. 74, 
1924. 


? Heinis H.: A Personal Constant. Journal of Educational Psychology, March, 4, .* ,. 


1928. 


*“Tnstruction Manual for the Kuhlmann-Anderson Intelligence Tests.’ 
(Forthcoming revision.) 
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taken from the files of the Harvard Growth Study.' The first group 
contained those cases whose IQ’s during the eight years averaged 120 
or higher; the second group those with IQ’s averaging between 95 and 
104, inclusive; and the third, dull children with average IQ’s below 80. 
The average number of tests for each child was 18.2 or 2.4 for each 
year. The tests used were: 


First year: 
Dearborn General Examination A 
Stanford-Binet Individual Examination (to about one-third the cases) 
Second year: 
Dearborn General Examination A 
Stanford-Binet Individual Examination (to about two-thirds the cases) 
Third year: 
Dearborn General Examination A 
Otis Primary Intelligence Test, Form A 
Fourth year: 
Dearborn General Examination C 
Otis Primary Intelligence Test, Form A 
Fifth year: 
Dearborn General Examination C 
Otis Self-administering Test of Mental Ability, Form A 
Sixth year: 
Dearborn General Examination C 
Otis Self-administering Test of Mental Ability, Form B 
Seventh year: 


Dearborn General Examination, Old Form 4 
Dearborn General Examination, Old Form 5 
Haggerty Intelligence Tests, Delta 2 
Stanford-Binet Individual Examination 


Eighth year: 
Haggerty Intelligence Examination, Delta 2 
Terman Group Test of Mental Ability, Form A or B 


In order to avoid, as far as possible, variations in the test results 
due to differences in standardization and the effects of practice each 





1The Growth Study was started at Harvard University under the direction 
of Walter F. Dearborn in 1922-1923. During the first year over 3500 children 
were given physical, intelligence and educational examinations, in so far as they 
can be located these same children have been examined every year since that date. 
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group test IQ was equated to the Stanford-Binet.! The equated 
1Q’s for each child were then averaged. In about one half of the cases 
there was only one IQ available for each of the first and second years. 





Except in a few scattered instances every child had at least two tests 
during each of the remaining six years. 

Thirty-seven cases with IQ’s of one hundred twenty or over, all 
of North European descent, were available. The files contained 





1 The tables used in equating the tests were loaned to the writer by Professor 
John Ratcliffe of Tufts College who is preparing, for publication, a set of percentile 
tables for equating IQ’s based on some 2000 cases. 
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approximately the same number of records of children with IQ’s below 
eighty as of children with IQ’s of one hundred twenty and above, but 
the attendance in school of the dull pupils is less regular and only 
eighteen of North European descent had had an intelligence test for 
each of the eight years. In order to increase the size of the group 
twenty-two Italians and two Jews were added, forming a group of 
forty-two cases. The average IQ and the general trend of the IQ 
curves of the pupils of Italian and of North European descent were 
almost identical. The group of pupils with average intelligence were all 
of North European descent and contained twenty cases. The mental 
age for each child for each year was determined from the C.A. and the 
average of the equated IQ’s; the Heinis Constant from the growth 
units given in the revised manual for the Kuhlmann-Anderson Intelli- 
gence Tests.! 

Table I and the upper heavy broken line in Fig. 1 indicates for each 
year the average IQ of the thirty-seven superior children and the light 
broken line the H.C.’s for the same children. In comparing the 
constancy of the IQ and the H.C. it must be remembered that 
the range of the H.C. is less than that of the IQ. The average IQ atthe 
end of the seventh year was only one point or three per cent further 
above one hundred than it was seven years earlier, on the other hand, 
the Heinis Constant showed a steady downward trend until at the end 
of the seventh year it was only fifty per cent as far above one hundred 
as it was at the beginning of the experiment. 


TaBLE I.—AveRAGE CHANGE IN INTELLIGENCE QUOTIENTS AND HEINIS 
ConsTANTs witH AGE or Dui, AVERAGE AND Brigut CHILDREN 











Dull Average | Bright 
Year 

IQ H.C. IQ H.C. | IQ H.C. 

1 80 88 103 102 129 115 

2 75 85 99 99 128 113 

3 76 86 100 100 128 112 

4 75 87 100 100 128 110 

5 74 87 100 100 130 110 

6 71 87 98 99 130 109 

e 72 88 98 99 130 108 

8 69 87 99 100 130 107 
ee 74 87 100 100 129 110 


























1 At the time of writing the revised manual had not come off the press. 
Advanced sheets kindly supplied by the publisher were used. 
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The first question that arises is why do the IQ’s of these superior 
pupils remain on the average constant, when most other studies have 
indicated an increase with age.':** The most probable explanation 
appears to lie in the fact that the former studies of the constancy of 
the 1Q of bright children which have not been complicated by different 
tests or the effects of practice have been based on Stanford-Binet IQ. 
The present study is based mainly on group test IQ’s. Stanford-Binet 
IQ’s were included in the averages when they were available, but after 
the second year the number of Binet IQ’s were small except in the 
seventh year when they were averaged in with three group test I1Q’s. 

Twenty-four of the thirty-seven superior pupils had a Stanford- 
Binet IQ available for the second year and for either the sixth or 
seventh year of the study. When this test alone is considered the 
expected gain in IQisfound. The mean Stanford-Binet IQ of twenty- 
four cases was one hundred twenty-three when the test was given in 
the second year and one hundred thirty-one when repeated four or 
five years later, a gain of eight IQ points. The Heinis Constant for 
the same pupils based on the Stanford-Binet test changed from one 
hundred eleven to one hundred nine, a change of two H.C. Points, 
or the equivalent of a loss of a little less than six IQ points. The fact 
that the group test 1Q’s remain constant while the Stanford-Binet IQ 
rises indicates the possibility that the gain in Stanford-Binet IQ of 
children of superior intelligence reported in the literature may be a 
function of the particular test rather than of the IQ technique. 

Table I and the dot-dash lines in Fig. 1 gives the average IQ and 
H.C. for a group of forty-two dull children. The downward trend of 
the IQ’s reported in the literature for the Stanford-Binet test is also 
evident in these group test IQ’s. The curve for the Heinis Constant 
is almost horizontal, thus meeting, at this level, the claims made for it. 
But it may be that the downward trend of the I1Q’s of the dull pupils 
is due to the fact that the group tests as well as the Binet are increas- 
ingly scholastic with advancing years and that the dull group lose out 
academically. Thus the decrease in IQ may be a real condition, not 
an artifact and should not be “‘corrected.”’ The IQ of the dull group 





‘Riley, Gordon: ‘‘A Comparison of the Personal Constant and IQ.” Psy- 
chological Clinic, February, 1930. 

* Cattell, Psyche: Constant changes in the Stanford-Binet IQ. Journal of 
Educational Psychology, October, 1931. 

* Cattell, Psyche: ‘‘Do the Stanford-Binet IQ’s of Superior Boys and Girls 
Tend to Decrease or Increase with Age.’’ (Ready for publication.) 
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might show constancy if some other avenues of expression were tested 
(the above explanation was suggested by Dr. W. F. Dearborn). 

The center section of Table I and the dotted lines in Fig. 1 give a 
comparison of the H.C. and IQ for the group of children of average 
intelligence. Both measures remain consistently near the one hundred 
mark during the seven-year period. The slightly greater closeness of 
the H.C. to one hundred is of no significance, it is due to the narrower 
range of the H.C. 

Gordon Riley' made a study of the comparative constancy of the 
Heinis Personal Constant and the Stanford-Binet IQ in three groups 
of feeble-minded children and one group of superior children. His 
results are in agreement with the present study in so far as the feeble- 
minded children are concerned, namely that he finds the IQ to decrease 
while the H.C. remains constant. His results obtained from a group 
of thirty children with I1Q’s averaging one hundred eighteen are in 
contrast to the above reported group test results of children with 
IQ’s averaging one hundred twenty-nine, in that he found the IQ 
to increase and the H.C. to remain constant. However, the differences 
between his results and our Stanford-Binet results are small and it may 
be that they could be accounted for by the fact that his group of 
superior children were of the lower order of intelligence than those 
included in the present study. 

As a check of the above emperical findings hypothetical cases were 
selected with H.C.’s of 65, 70 . . . 135, and the 1Q’s that would of 
necessity result if the H.C. were constant from year to year calculated. 
This was done by looking up in Dr. Heinis’ table the mental age at 
different chronological ages of a pupil with the given H.C., then the 
IQ was calculated based on the mental age thus obtained and the 
chronological age. The changes that would result in the IQ are shown 
in Fig. 4. It has not been proven that the intelligence quotients of 
bright children, when calculated by the standard method remain on 
the average constant, but there is no evidence to indicate that the very 
high IQ’s increase to such an extent as these curves show that they 
must if the H.C. were constant. A child that had a constant H.C. 
at 110 would have an IQ of one hundred seventeen at the age of six, 
which would increase to one hundred thirty-four at twelve years and 
one hundred eighty-five at fifteen; a child with a constant H.C. of 
one hundred fifteen would increase in IQ from one hundred twenty-six 





1 Riley, Gordon: “A comparison of the Personal Constant and IQ.” 
Psychological Clinic, February, 1930. 
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at six years of age to one hundred seventy-nine at twelve years and 
would be up in the four hundreds by fifteen years. 

The extent by which the low IQ’s must decrease if it were to be 
assumed that the Heinis Constant were constant is from ten to twenty 





Fig. 2.—Changes that would take place in the IQ if the Heinis Constant were constant. 


points as indicated in the lower part of Fig. 2. This is not beyond 
the realm of reason. The dull group in the present study decreased 
eleven points in IQ between the ages of about seven and fourteen. 
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Dr. Heinis put his mathematical conclusions to a test using the 
results of Binet tests on feeble-minded children reported by Dr, 
Kuhlmann.' He found the facts to fit with his theoretical conclusions, 
but unfortunately no such test was made with superior children. Our 
results also indicate that the Hienis Constant may be constant for 
children of subnormal children. The large discrepencies lie at the 
other end of the scale. 

The present study points to the conclusion that: (1) The Heinis 
Personal Constant is more constant than the IQ for children below 
average in intelligence, at least when based on linguistic test, but that 
for children of above average intelligence it is less constant than the 
IQ. (2) There is some indication that the gain in Stanford-Binet IQ 
of superior pupils with age may be a function of the particular test 
rather than of the IQ technique. 





1 The Results of Repeated Mental Re-examinations of 639 Feeble-minded over 
a Period of Ten Years. Journal of Applied Psychology, Vol. V, pp. 195-224. 
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THE DIFFICULTY OF A MULTIPLE CHOICE TEST 
ITEM 


PAUL HORST 
Personnel Research Division, Proctor and Gamble Company 


We shall define the difficulty of a multiple choice test item as being 
a function of that proportion of individuals answering the item which 
knows which of the alternatives is the best answer. This definition 
involves the assumption that there is some objective criterion which 
determines that one particular alternative is a better answer to the 
item than any of the others. The nature of this criterion and the 
logic involved in the technique for constructing it will not be discussed 
in the present article. 

In the following analysis we simplify our problem by assuming that 
the text of the item involves a single idea, and that the alternatives 
contain the same idea to varying degrees. We assume further that 
the group of individuals answering the item is distributed according 
to some definite analytical frequency function with reference to ability 
to discriminate between varying degrees of the idea. That the text 
may contain more than one distinct idea and that each alternative 
may contain the same ideas in varying degrees is a condition which 
exists in many cases. Thus it is possible that with reference to one 
idea in the text the alternatives may take one order when arranged 
serially according to the degree to which they express that idea, while 
with reference to another idea they may take quite another order. 

Further, it is easily possible that the group of individuals answering 
the test item when distributed according to ability to discriminate 
between varying degrees of one idea may take quite another distribu- 
tion when distributed with reference to ability to discriminate between 
varying degrees of another idea. 

Suppose now we have a multiple choice item which involves essen- 
tially but one idea. We have n alternatives which express the same 
idea to a greater or less degree. We represent each alternative by 
the amount of discriminatory ability required to differentiate between 
the idea expressed in it and the idea expressed in the text. These we 
indicate by 2x1, x2, to x, where x; requires the least discriminatory 
ability and z, the most. 
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Now suppose we have a group of individuals distributed according 
to ability to discriminate between varying degrees of the idea expressed 
in the text. Let us represent this distribution by f(z). 

Then none of the individuals below zx, will be able to differentiate 
between the ideas expressed in any of the alternatives and the idea 
expressed in the text. This proportion of individuals may be repre- 
sented by | f(x)dzx where Xp is the lower root of f(x). Let us indicate 
by S:, Se, . . . S, the proportion of individuals marking each alter- 
native. We assume that all individuals below 2; in ability will be 
equally likely to mark any of the alternatives, so that the proportion 


1 1 
below X; marking each alternative will be 7 | J(x)dz. The proportion 


of individuals between z; and z- in ability is | f(x)dz and these will 


JT 


be equally likely to mark any of the alternatives above z, so that of 





° 1 Z2 
this proportion >—j i} f(x)dx will mark each alternative from 2, 


1 Xa 
to tn. Similarly, of the proportion between z2 and z;, noo) Jade 


will mark each alternative from x; to z,. Thus we have, 











Si = : Saas 
1 2 
S2=— Ea 1 sorerry F f(x)dz 
S.1= 5 | fea + wai { Foe ee “Fix | (1) 
S, = oe a + wat { See deo es 
+5 [ Fade + ( f@de 
In-2 In-1 


where x in the upper limit of the last term of the last equation is the 
upper root of f(z). It should be noted that this last integral includes 
all individuals between z,_; and z, in ability, as well as those beyond 
Zn. This includes therefore, all individuals who know that none of 
the first z,_; alternatives is a best answer plus those who know that 
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z, is the best answer. This distinction, while not of much practical 
importance, is of considerable theoretical importance. 

If we subtract the next to last equation in (1) from the last equa- 
tion, we get, 


8. —Sar= {fedex ae 


In-1 


which means that the proportion of individuals who know the best 
answer is the difference between the highest proportion marking an 
alternative and the next highest proportion marking an alternative. 

Since S, — Sp-1 gives a definite numerical value z,_; can be solved 
for in (2) providing f(z) is given. Thus the difficulty of a multiple 
choice test item is a function of the degree of discriminatory ability 
required to differentiate between the idea expressed in the text and the 
alternative marked by the next to highest proportion of the group. 

If the alternative which the largest proportion of the group mark 
is not, according to certain objective criteria, the best answer, then 
we must assume that the text contains several ideas and that the group 
is not discriminating with reference to the idea for which the best 
answer has been objectively determined. This situation requires a 
more detailed analysis which will be set forth in a separate article. 

Now if we have a very large number of cases, we may define our 
unit of measurement of discriminatory ability so that the distribution 
is normal, and adjust this unit so that it becomes the standard devia- 
tion of a normal distribution. If we adopt as the origin of the dis- 
criminatory scale the mean of the group, we may write (2) 


ee eee f o tae (3) 


n-l 


and from a table of the normal probability integral we may find the 
value of X,_; in (3). This gives us the difficulty of the test item 
measured from the mean of the group in terms of its standard deviation. 

It should be noted from (1) and (2) that the proportion of individ- 
uals which marks the best answer by chance is the same as the pro- 
portion which marks the next best answer. From this it is clear that 
for a test item of any given degree of difficulty the proportion marking 
the best answer will contain the least possible proportion due to chance 
when all alternatives but the best answer are of equal difficulty, for 
with the proportion marking the best answer remaining constant, the 
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proportion marking the next best answer must be reduced as the other 
alternatives approach it in difficulty. 

This fact is of considerable importance in constructing multiple 
choice test items, for it is evident that fewer alternatives, equal in 
difficulty, may result in a smaller chance element in the answer than 
is given by a greater number of alternatives presenting a wider range 
of difficulty. 
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COMMUNICATIONS AND DISCUSSIONS 


IN DISAGREEMENT WITH E. A. LINCOLN’S ARTICLE “THE 
UNRELIABILITY OF RELIABILITY COEFFICIENTS” 


LUTON ACKERSON 
Institute for Juvenile Research, Chicago 


Professor Lincoln’s article with the above provocative title! has 
already been discussed by Franzen and Derryberry.? While agreeing 
entirely with these critics, I feel that more should be added, with the 
aid of further figures from Lincoln’s data which he has generously sent 
me in full knowledge that I would endeavor to use them to refute his 
thesis. He writes that ‘‘the median difference for boys’ sternal height 
is 6.2 mm., and the reliability coefficient is .994, while the median differ- 
ence for boys’ head width is 1.7 mm., with a reliability coefficient of 
only .591. The same inconsistency [italics mine] is found in the results 
for the girls. Here the median difference in sternal height is 5.0 mm., 
with a reliability of .995, while in head width the median difference is 
1.7 mm., and the coefficient is .694”’ (p. 13). He concludes that since 
the higher reliability coefficient is accompanied by larger absolute 
differences between two series of measures in his material there must 
be something wrong with reliability coefficients as a device for demon- 
strating the agreement between parallel series of observations. 

His error lies in attempting to compare absolute raw values in 
different fields of data without regard to the respective variabilities of 
these fields, as Franzen and Derryberry point out. The most satis- 
factory method of rendering different series of measures comparable 
is by reducing the original scores to ‘‘standard measures,”’ a procedure 
long known to statisticians and adequately described in Kelley’s*and 
Holzinger’s‘ texts, but unfortunately still a custom of the future so 
far as most of us biometricians and psychometricians are concerned. 
Although Lincoln’s data were not presented in the form of standard 
measures, we are safe in inferring that by dividing his median differ- 





! Journal of Educational Psychology, Vol. X XIII, 1932, pp. 11-14. 

*R. H. Franzen and C. M. Derryberry: Journal of Educational Psychology, 
Vol. XXIII, 1932, pp. 559-560. 

*T. L. Kelley: “Statistical Method.”” New York: Macmillan, 1923, pp. 
114-117. 

‘*K. J. Holzinger: “Statistical Methods for Students in Education.” New 
York: Ginn, 1928, pp. 118-122. 
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ences by the appropriate standard deviations, we may obtain a fair 
estimate of what the results would have been if standard measures 
had been used. 

Among his seventh-grade boys the standard deviations of sternal 
height and head width were about 72.0 mm., and 5.7 mm., respec- 
tively.1_ In terms of standard measures, then, the median or average 
differences between the two sternal-height series and between the two 
head-width series are not far from .09 and .30, respectively. That is, 
the discrepancy between the two series of head-width measures is about 
three times as great as that between the two series of sternal-height 
measures. When we recall that the reliability coefficient for head 
width was only .591 while that for sternal height reached the high value 
of .994, we see that, after all, the reliability coefficient gives a truer 
picture of agreement or disagreement then does the mere comparison 
of absolute ‘actual differences,” as Lincoln terms them. 

Among his girls the reliability coefficient similarly provides a more 
meaningful index of the adequacy of the measurement-taking. The 
standard deviations for sternal height and for head width were about 
72.0 mm. and 8.5, respectively, and the median or average differences 
in terms of standard measures would be approximately .07 and .20, 
respectively, while the corresponding reliability correlations were .995 
and .694. Here again, the reliability coefficients serve as the better 
indicator of the accuracy or consistency with which these traits were 
measured. In other words, head width in his data was being measured 
much less adequately than the vertical anthropometric measures, in 
spite of the smaller “‘actual differences’’ which were found in the head- 
width series. 

In fact, to follow Lincoln’s criterion that the degree of reliability of 
two series of measures may be inferred from their actual differences, 
which in a statistical sense are only arbitrary numerical values, would 
lead to absurd conclusions. For example, nasal length measured in 
centimeters would thereby be a much more reliable measure than total 
stature measured in centimeters, regardless of how clumsily nasal 
length may be measured, merely because the figures representing the 
raw differences would be smaller. Or similarly, one could argue with 
equal correctness or incorrectness that measurements of stature when 
recorded in inches would be more reliable than when recorded in centi- 
meters because the actual raw differences were smaller. 





1From a personal communication from Professor Lincoln. 
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In short, the present writer’s perusal of Professor Lincoln’s data 
has not altered his belief that there is no substitute for reliability corre- 
lations in studies based upon variate data. 





RELIABILITY COEFFICIENTS ARE STILL UNRELIABLE 


EDWARD A. LINCOLN 
Harvard University 


After the publication of an article entitled The Unreliability of 
Reliability Coefficients,! the writer received several letters pointing out 
alleged fallacies in his argument. Also, two or three articles have 
been published which take the same stand.? Perhaps another brief 
word will be of interest to the critics and others who are concerned with 
this problem. 

The chief contention of the critics is that the reliability coefficient 
is a better measure because it is a standard measure; that is, it is inde- 
pendent of the absolute raw values of the measures taken. The 
latter part of this statement is, of course, true. It is also true, how- 
ever, that in some instances the absolute rather than the relative values 
are important to the investigator, and then a relative measure is 
neither valid nor useful. That was the case in the illustration cited in 
the original article, and the same argument would apply in many 
other cases. 

It should be remembered that the same characteristic of the relia- 
bility coefficient which makes it very serviceable in some instances 
renders it misleading and worthless in others.* This, as every real 
statistician knows, holds true for every statistical measure which we 
have. That is, no measure is always and inherently better than any 
other; it is only better to show some things. 

Let us consider again the problem with which the original article 
dealt. The writer was seeking an answer to the question, Is the 
technique of measuring heights employed in the Harvard Growth 


1 Journal of Educational Psychology, Vol. XXIII, 1932, p. 11-14. 
? Franzen, R. H. and C. M. Dewberry: Journal of Educational Psychology, Vol. 
XXIII, 1932, pp. 559-560. 
Heilman, J. D.: The Teachers Journal and Abstract, Vol. VII, 1932, p. 93. 
Akerson, L.: Journal of Educational Psychology, Vol. XXIV, No. 3. 
* Heilman makes this point very aptly. He says, ‘‘No one misinterprets the 
coefficient of reliability who remembers that it is intended to measure only 
variable errors and that it varies with the standard deviation of the distribution.” 
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Study giving measurements of the individual pupil which are accurate 
and dependable? The reliability coefficients of .99 plus said, Yes. 
When, however, a study was made of the actual differences in two sets 
of measurements it was discovered that in nearly half the cases there 
was a difference of half a centimeter, and that there were many differ- 
ences of a whole centimeter or more. Clearly the reliability coef- 
ficients in these cases were very misleading, and no one has successfully 
refuted this important point. In short, the critics have explained why 
the difficulty occurs, but they have not explained it away. The 


writer has neither seen nor heard any argument which moves him to 


change his original position that reliability coefficients when used 
in such a study are very unreliable. 





STUDENTS’ RATINGS OF LECTURES 
1. COMPARATIVE RATING OF GENERAL 
PSYCHOLOGY LECTURE PARTS 


NATHAN ISRAELI 


University of Maine 


AN INTROSPECTIVE STUDY 


Not all parts of a lecture appeal alike or are equally interesting. 
Just how students rate comparatively lecture parts on the basis of 
interest can be measured. This study is based on subjective or intro- 
spective judgments of students. Incidentally, it leads to a comparison 
of interest ratings of textbook and non-textbook lecture material. 
Demonstrations, delineoscope illustrations, material on the textbook, 
non-textbook material, data from the lecturer’s experiments are five 
parts of lectures in a General Psychology course rated by students for 
interest. 


A LECTURE RATING QUIZ 


Two different classes in General Psychology were given the follow- 
ing lecture elements rating quiz: 





This quiz is for pedagogic and scientific purposes. Make your 
answers express your true opinion. 

Here are five parts of the lectures in this course. You are to 
number each item in accordance with your opinion of how interesting 
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they are. The most interesting item should be numbered 1, and the 
least interesting item should be numbered 5. 








WRITE 
NUMBER 
HERE IreMs OF THE GENERAL PsycuoLocy LecTuRES 
A. Demonstrations! 
——— B. Delineoscope illustrations 
——— C. Lecture material on the textbook 
——  D. Non-textbook lecture material? 
—— E. Lecture material from the present lecturer’s own experiments, etc. 
Group A, sixty-six subjects, and Group B, thirty-eight subjects, 


included mostly sophomores, had the same instructor, and were not 
equivalent groups. 
RESULTS 

The most interesting part of a General Psychology lecture consists 
of demonstrations. Both groups think this is so. For Group A, 
next in order of interest are the delineoscope illustrations, non-textbook 
lecture material, textbook material, and lecturer’s illustrations; and 
for Group B, lecturer’s illustrations, non-textbook, textbook material, 
delineoscope. This is shown in Table I. 


TaBLE I.—Most INTERESTING LECTURE ELEMENT 





Percentage rating a lecture-part 


as the most interesting one! 
Lecture-part 











Group A Group B 
oc we owes | 6 5 
. Delineoscope illustrations............. | 1 
. Non-textbook lecture material......... 1 
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Lecture material on textbook........... | 
. Material from lecturer’s experiments . . 7 
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1 Taken from the first of a five column order-of-merit table for each group. 
Data in this table are taken from the first column of order-of-merit tables not 
included here. 


Another way to look at the results is to take the ranking item in 
each of the five columns of the order-of-merit table and we find the 





1 For example, the demonstration of performance tests, demonstration of 
individual differences in emotional reactions to music, etc. 

? Part of the lecture covering material not in — 
in detail. 
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following order for Group A: demonstrations, delineoscope, lecturer’s 
material, lecturer’s material, non-textbook. For Group B: demon- 
strations, delineoscope, delineoscope and textbook, non-textbook, 
textbook. According t0 this series of ranking items in each of a 
five column order-of-merit table, demonstrations rank as the most 
interesting lecture element. 

Weighting each column in the order-of-merit table gives a com- 
bined score from the different columns for each one of the five lecture 
items. Column one is multiplied by five, two by four, three by three, 
four by two, and five by one. Total scores are computed for the 
different lecture items on the basis of frequency and not percentage. 
The total scores, given in the first two columns of table IV, are 
reduced in the last two columns to a comparable basis, based on lecture 


TaBLE II.—ComparaTIvE RATING oF GENERAL PsycHoLoGy LEcTURE Parts 











Total score | Rank order Comparative 
rating 
Lecture-part 

Group} Group/ Group} Group/Group| Group 

A B A B A B 

! 

A. Demonstrations................ 283 151 1 1 100 100 
B. Delineoscope illustrations........ 185 100 2 4 65.4) 66.2 
E. Lecturer’s illustrations.......... 183 127 3 2 64.7| 84.1 
D. Non-textbook lecture material...| 172 112 4 3 60.8) 74.2 
C. Lecture material on textbook....| 167 80 5 5 59.0; 53.0 























elements A (demonstrations) as one hundred. The middle columns 
show the new rank order of the General Psychology lecture parts for 
each group. 

In all instances, practically, the non-textbook ranks higher than 
the textbook material. To contrast further these two important 
lecture parts, the total scores of part #, lecturer’s material, and part 
D, non-textbook material, are combined together and the new total 
score may be compared to that of part C, the textbook material. The 
new total score for the combined non-textbook lecture material is 
three hundred fifty-five for Group A and two hundred thirty-nine for 
Group B. Or, the combined non-lecture material is 2.13 times as 
interesting as the textbook material to Group A and is 2.99 times as 
interesting as the textbook material to Group B.! 





1 This treatment of the data suggests further work on a comparison of textbook 
and non-textbook material. In a different experiment, it was found that Organic 
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SUMMARY 


Five parts of a General Psychology lecture courses were rated for. 
their interest by two different sections in that course. The most 
interesting part of the lectures consists of demonstrations and the 
least interesting part, of the textbook material. Combined non- 
textbook material is 2.13 and 2.99 times as interesting as the textbook 
material to the two different classes. 





Chemistry students rate the lecture on the textbook material as less valuable for 
instruction than the lecture on non-textbook material. 
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BOOK REVIEW 






The Vibrato. Lowa City: University of lowa Studies in the Psychology 
of Music, New Series No. 225, April, 1932. 


This is a thorough and excellent piece of research work in a field 
much in need of experimental investigation. Following closely the 
pioneer experimental and more localized work of Cheslock,! it unravels 
further the so-called mysteries of this “soul of the musician’s tone.” 
The University of Iowa volume, edited by Carl Seashore, is a collection 
of studies by eleven investigators. By far the greater part is devoted 
to voice; in fact, the entire volume could perhaps on this account 
have been more appropriately titled Voice Vibrato. 

The effects of pitch, intensity and periodicity, the three com- 
ponents of vibrato and their interesting and significant inter-relations 
are analyzed in detail with ample diagrammatic illustrations. The 
findings show that fluctuations in these components are intimately 
linked with our concept of timbre and quality; that the vibrato can be 
modified through training by directing the attention to certain aspects 
of the problem; that vibrato itself varies considerably with the tonal 
and emotional environment in which it is used. The very careful 
methods of procedure used and the specific nature of the individual 
problems give the results dependability and conciseness. In addition 
there is a historical summary of established facts in experimental 
studies on vibrato brought up to date, detailed description of phono- 
photographic apparatus used, comparative study of student vibrato 
and artist vibrato, of concert singing and opera singing, of attack, 
release and transition in vibrato. The volume contains a wealth o 
significant and interesting material. A condensed, non-technica 
summary of these studies will be helpful to music students and teachers 
not versed in psychological technique and procedure. 

OTTO ORTMANN. 













Peabody Conservatory of Music, Baltimore. 





1 Cheslock, Louis: “Introductory Study on Violin Vibrato.’”’ Peabody Con 
servatory Research Studies. 
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